(19) 




(12) 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets (11) EP 0 148 605 B2 

NEW EUROPEAN PATENT SPECIFICATION 



(45) Date of publication and mention 
of the opposition decision: 
23.12.1998 Bulletin 1998/52 

(45) Mention of the grant of the patent: 
25.07.1990 Bulletin 1990/30 



(51) Intel©: C12N 15/00, C12P 21/02, 
C12N1/00, C12N5/00, 
C12Q 1/68, C07K 14/00, 
C07K2/00, A61K 38/08 



(21) Application number: 84308654.7 

(22) Date of filing: 12.12.1984 



(54) Production of erythropoietin 

Herstellung von Erythropoietin 
Production d'6rythropoi6tine 



(84) Designated Contracting States: 

AT BE CH DE FR GB IT LI LU NL SE 

(30) Priority: 13.12.1983 US 561024 
21.02.1984 US 582185 
28.09.1984 US 655841 
30.11.1984 US 675298 

(43) Date of publication of application: 
17.07.1985 Bulletin 1985/29 

(73) Proprietor Klrln-Amgen, Inc. 
Thousand Oaks California 91320 (US) 

(72) Inventor: Lin, Fu-Kuen 
California 91 360 (US) 

(74) Representative: Brown, John David et al 
FORRESTER & BOEHMERT 

Franz- Joseph-Strasse 38 
80801 Munchen (DE) 



(56) References cited: 
WO-A-82/02060 
FR-A- 2 501 692 
US-A-4 254 095 



WO-A-85/03079 
GB-A- 2 085 887 



CM 

CD 
o 

CD 

00 
*T 

O 

CL 
LU 



PROC. NATL. ACAD. SCI. USA, vol. 80, June 
1983, pages 3651-55; US J.M. SUE et al.: 
"Site-specific antibodies to human 
erythropoietin directed toward the NH2-terminal 
region." 



CHEMICAL ABSTRACTS, vol. 87, 1977, page 294, 
ref. no. 129775g; Columbus, Ohio, US. T. 
Ml YAK E et al.: "Purification of human 
erythropoietin." 

BIOMED. BIOCHIM. ACTA, vol. 42, no. 11/12, 
1983, pages 202-206. R. SASAKI et al.: "Isolation 
of erythropoietin by monoclonal antibody." 
PROC. NATL. ACAD. SCI. USA, vol. 80, February 
1983, pages 802-806; US. J. SINGER-SAM et al.: 
"Isolation of a cDNA clone for human X-linked 
3-phosphoglycerate kinase by us of a mixture of 
synthetic oligo-deoxy ribonucleotides as a 
detection probe." 

BIOLOGICAL ABSTRACTS/RRM, abstract 
27023643; Biosciences Information Service, 
Philadelphia, US. N. FARBER et al.: "Translation 
of messenger RNA from human kidneys into 
biologically active erythropoietin following 
micro Injection into Xenopus-Laevls oocytes." 
PROC. NATL. ACAD. SCI. USA, vol. 81 , May 1 984, 
pages 2708-2712; US. S. LEE-HUANG: "Cloning 
and expression of human erythropoietin cDNA 
in Excherichfa coli." 

JOURNAL CELLULAR BIOCHEMISTRY, Suppl. 
8B, 1984, Abstracts of the 13th Annual UCLA 
Symposium, March 31 - April 29, 1984, page 45, 
ref. no. 0951 

F K. LIN et al.: "Cloning of the monkey 
erythropoietin gene." 

PROC. NATL. ACAD. SCI. USA, vol. 82, no. 22, 
November 1985, pages 7580-7584, US. F.K. LIN 
et al.: "Cloning and expression of the human 
erythropoietin gene." 



Printed by Jouve, 75001 PARIS (FR) 



EP 0 148 605 B2 



10 



Description 

Background 

The present invention relates generally to the manipulation of genetic materials and, more particularly, to recom- 
binant procedures making possible the production of polypeptides possessing part or all of the primary structural con- 
formation. 

A. Manipulation Of Genetic Materials 



Genetic materials may be broadly defined as those chemical substances which program for and guide the manu- 
facture of constituents of cells and viruses and direct the responses of cells and viruses. A long chain polymeric sub- 
stance known as deoxyribonucleic acid (DNA) comprises the genetic material of all living cells and viruses except for 
certain viruses which are programmed by ribonucleic acids (RNA). The repeating units in DNA polymers are four 

is different nucleotides, each of which consists of either a purine (adenine or guanine) or a pyrimidine (thymine or cytosine) 
. bound to a deoxyribose sugar to which a phosphate group is attached. Attachment of nucleotides in linear polymeric 
form is by means of fusion of the 5' phosphate of one nucleotide to the 3' hydroxy! group of another. Functional DNA 
occurs in the form of stable double stranded associations of single strands of nucleotides (known as deoxyoligonucle- 
otides), which associations occur by means of hydrogen bonding between purine and pyrimidine bases [i.e., "comple- 

20 mentary" associations existing between adenine (A) and thymine (T) or guanine (G) and cytosine (C)]. By convention, 
nucleotides are referred to by the names of their constituent purine or pyrimidine bases, and the complementary as- 
sociations of nucleotides in double stranded DNA (i.e., A — T and G — C) are referred to as "base pairs 0 . Ribonucleic 
acid is a polynucleotide comprising adenine, guanine, cytosine and uracil (U), rather than thymine, bound to ribose 
and a phosphate group. 

25 Most briefly put, the programming function of DNA is generally effected through a process wherein specific DNA 

nucleotide sequences (genes) are "transcribed" into relatively unstable messenger RNA (mRNA) polymers. The mRNA, 
in turn, serves as a template for the formation of structural, regulatory and catalytic proteins from amino acids. This 
mRNA "translation" process involves the operations of small RNA strands (tRNA) which transport and align individual 
amino acids along the mRNA strand to allow for formation of polypeptides in proper amino acid sequences. The mRNA 

30 "message", derived from DNA and providing the basis for the tRNA supply and orientation of any given one of the 
twenty amino acids for polypeptide "expression", is in the form of triplet "codons" — sequential groupings of three 
nucleotide bases. In one sense, the formation of a protein is the ultimate form of "expression" of the programmed 
genetic message provided by the nucleotide sequence of a gene. 

"Promoter" DNA sequences usually "precede" a gene in a DNA polymer and provide a site for initiation of the 

35 transcription into mRNA. "Regulator" DNA sequences, also usually "upstream" of (i.e., preceding) a gene in a given 
DNA polymer, bind proteins that determine the frequency (or rate). of transcriptional initiation. Collectively referred to 
as "promoter/regulator" or "control" DNA sequence, these sequences which precede a selected gene (or series of 
genes) in a functional DNA polymer cooperate to determine whether the transcription (and eventual expression) of a 
gene will occur. DNA sequences which "follow" a gene in a DNA polymer and provide a signal for termination of the 

40 transcription into mRNA are referred to as transcription "terminator" sequences. 

A focus of microbiological processing for the last decade has been the attempt to manufacture industrially and 
pharmaceutical^ significant substances using organisms which either do not initially have genetically coded information 
concerning the desired product included in their DNA, or (in the case of mammalian cells in culture) do not ordinarily 
express a chromosomal gene at appreciable levels. Simply put, a gene that specifies the structure of a desired polypep- 

45 tide product is either isolated from a "donor" organism or chemically synthesized and then stably introduced into another 
organism which is preferably a self-replicating unicellular organism such as bacteria, yeast or mammalian cells in 
culture. Once this is done, the existing machinery for gene expression in the "transformed" or "transfected" microbial 
host cells operates to construct the desired product, using the exogenous DNA as a template for transcription of mRNA 
which is then translated into a continuous sequence of amino acid residues. 

so The art is rich in patent and literature publications relating to "recombinant DNA" methodologies for the isolation, 

synthesis, purification and amplification of genetic materials for use in the transformation of selected host organisms. 
U.S. Letters Patent No. 4,237,224 to Cohen, at al., for example, relates to transformation of unicellular host organisms 
with "hybrid" viral or circular plasmid DNA which includes selected exogenous DNA sequences. The procedures of the 
Cohen, at al. patent first involve manufacture of a transformation vector by enzymatically cleaving viral or circular 

55 plasmid DNA to form linear DNA strands. Selected foreign ("exogenous" or "heterologous") DNA strands usually in- 
cluding sequences coding for desired product are prepared in linear form through use of a similar enzymes. The linear 
viral or plasmid DNA is incubated with the foreign DNA in the presence of ligating enzymes capable of effecting a 
restoration process and "hybrid" vectors are formed which include the selected exogenous DNA segment "spliced" 
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into the viral or circular DNA plasmid: 

Transformation of compatible unicellular host organisms with the hybrid vector results in the formation of multiple 
copies of the exogenous DNA in the host cell population. In some instances, the desired result is simply the amplification 
of the foreign DNA and the 'product" harvested is DNA. More frequently, the goal of transformation is the expression 
5 by the host cells of the exogenous DNA in the form of large scale synthesis of isolatable quantities of commercially 
significant protein or polypeptide fragments coded for by the foreign DNA. See also, e.g., U.S. Letters Patent Nos. 
4,264,731 (to Shine), 4,273,875 (to Manis), 4,293,652 (to Cohen), and European Patent Application 093,61 9, published 
November 9, 1983. 

The development of specific DNA sequences for splicing into DNA vectors is accomplished by a variety of tech- 

10 niques, depending to a great deal on the degree of "foreignness" of the 'donor" to the projected host and the size of 
the polypeptide to be expressed in the host. At the risk of over-simplification, it can be stated that three alternative 
principal methods can be employed: (1) the "isolation" of double-stranded DNA sequence from the genomic DNA of 
the donor; (2) the chemical manufacture of a DNA sequence providing a code for a polypeptide of interest; and (3) the 
in vitro synthesis of a double-stranded DNA sequence by enzymatic "reverse transcription" of mRNA isolated from 

75 donor cells. The last-mentioned methods which involve formation of a DNA •complement" of mRNA are generally 
referred to as "cDNA" methods. 

Manufacture of DNA sequences is frequently the method of choice when the entire sequence of amino acid residues 
of the desired polypeptide product is known. DNA manufacturing procedures of co-owned, co-pending U.S. Patent 
Application Serial No. 483,451 , by Alton, at al,, (filed April 1 5, 1 983 and corresponding to PCT US83/00605, published 

20 November 25, 1 983 as W083/04053), for example, provide a superior means for accomplishing such highly desirable 
results as: providing for the presence of alternate codons commonly found in genes which are highly expressed in the 
host organism selected for expression (e.g., providing yest or £ coli "preference" codons); avoiding the presence of 
untranslated "intron" sequences (commonly present in mammalian genomic DNA sequences and mRNA transcripts 
thereof) which are not readily processed by procaryotic host cells; avoiding expression of undesired "leader" polypep- 

25 tide sequences commonly coded for by genomic DNA and cDNA sequences but frequently not readily cleaved from 
the polypeptide of interest by bacterial or yeast host ceils; providing for ready insertion of the DNA in convenient 
expression vectors in association with desired promotor/regulator and terminator sequences; and providing for ready 
construction of genes for polypeptide fragments and analogs of the desired polypeptides. 

When the entire sequence of amino acid residues of the desired polypeptide is not known, direct manufacture of 

30 DNA sequences is not possible and isolation of DNA sequences coding for the polypeptide by a cDN A method becomes 
the method of choice despite the potential drawbacks in ease of assembly of expression vectors capable of providing 
high levels of microbial expression referred to above. Among the standard procedures for isolating cDNA sequences 
of interest is the preparation of plasmid-borne cDNA "libraries" derived from reverse transcription of mRNA abundant 
in donor cells selected as responsible for high level expression of genes (e.g., libraries of cDNA derived from pituitary 

35 cells which express relatively large quantities of growth hormone products). Where substantial portions of the polypep- 
tide's amino acid sequence are known, labelled, single-stranded DNA probe sequences duplicating a sequence puta- 
tively present in the "target" cDNA may be employed in DNA/DNA hybridization procedures carried out on cloned 
copies of the cDNA which have been denatured to single stranded form. [See, generally, the disclosure and discussions 
of the art provided in U.S. Patent No. 4,394,443 to Weissman, at al. and the recent demonstrations of the use of long 

40 oligonucleotide hybridization probes reported in Wallace, at al., Nuc. Acids Res., 6, pp. 3543— 3557 (1 979), and Reyes, 
at al., RN.A.S. (U.S.A.), 79, pp. 3270—3274 (1982), and Jaye, at al., Nuc. Acids Res., 11, pp. 2325—2335 (1983). 
See also, U.S. Patent No. 4,358,535 to Falkow, at at., relating to DNA/DNA hybridization procedures in effecting diag- 
nosis; published European Patent Application Nos. 0070685 and 0070687 relating to light-emitting labels on single 
stranded polynucleotide probes; Davis, at al., "A Manual for Genetic Engineering, Advanced Bacterial Genetics", Cold 

45 Spring Harbor Laboratory, Cold Spring Harbor, N.Y (1980) at pp. 55—58 and 174—176, relating to colony and plaque 
hybridization techniques; and, New England Nuclear (Boston, Mass.) brochures for "Gene Screen" Hybridization Trans- 
fer Membrane materials providing instruction manuals for the transfer and hybridization of DNA and RNA, Catalog No. 
NEF-972] 

Among the more significant recent advances in hybridization procedures for the screening of recombinant clones 
50 is the use of labelled mixed synthetic oligonucleotide probes, each of which is potentially the complete complement of 
a specific DNA sequence in the hybridization sample including a heterogenous mixture of single stranded DNAs and 
RNAs. These procedures are acknowledged to be especially useful in the detection of cDNA clones derived from 
sources which provide extremely low amounts of mRNA sequences for the polypeptide of interest. Briefly put, use of 
stringent hybridization conditions directed toward avoidance of non-specific binding can allow, e.g., for the autoradio- 
55 graphic visualization of a specific cDNA clone upon the event of hybridization of the target DNA to that single probe 
within the mixture which is its complete complement. See generally, Wallace, at al., Nuc. Acids Res., 9, pp. 879 — 897 
(1981); Suggs, at al. RN.A.S. (U.S.A.), 78, pp. 6613—6617 (1981); Choo, at al., Nature, 299, pp. 178—180 (1982); 
Kurachi, at al., RN.A.S. (USA), 79, pp. 6461—6464 (1982); Ohkubo, at al., RN.A.S. (U.S.A.), 80, pp. 2196—2200 
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(1983); and Kornblihtt, at al., RN.AS. (U.S.A.), 80, pp. 3218-3222 (1983). In general, the mixed probe procedures of 
Wallace, at al. (1981), supra, have been expanded upon by various workers to the point where reliable results have 
reportedly been obtained in a cDNA clone isolation using a 32 member mixed "pool" of 16-base-long (16-mer) oligo- 
nucleotide probes of uniformly, varying DNA sequences together with a single 11-mer to effect a two-site "positive" 

5 confirmation of the presence of cDNA of interest. See, Singer-Sam, at al., RN.AS. (U.S.A.), 80, pp. 802 — 806 (1 983). 
The use of genomic DNA isolates is the least common of the three above-noted methods for developing specific 
DNA sequences for use in recombinant procedures. This is especially true in the area of recombinant procedures 
directed to securing microbial expression of mammalian polypeptides and is due, principally to the complexity of mam- 
malian genomic DNA. Thus, while reliable procedures exist for developing phage-borne libraries of genomic DNA of 

10 human and other mammalian species origins [See, e.g., Lawn, at al. Cell, 15, pp. 1157 — 1174 (1978) relating to pro- 
cedures for generating a human genomic library commonly referred to as the "Maniatis Library"; Karri, at al., RN.A.S. 
(U.S.A.), 77, pp 51 72 — 51 76 (1 980) relating to a human genomic library based on alternative restriction endonuclease 
fragmentation procedure; and Blattner, at al., Science, 196, pp. 161—169 (1977) describing construction of a bovine 
genomic library) there have been relatively few successful attempts at use of hybridization procedures in isolating 

ts genomic DNA in the absence of extensive foreknowledge of amino acid or DNA sequences. As one example, Fiddes, 
at al., et al., J. Mot, and App. Genetics, 1, pp. 3 — 18 (1981) report the successful isolation of a gene coding for the 
alpha subunit of the human pituitary glycoprotein hormones from the Maniatis Library through use of a "full length" 
probe including a complete 621 base pair fragment of a previously-isolated cDNA sequence for the alpha subunit. As 
another example, Das, at al., RN.AS. (U.S.A.), 80, pp. 1531-1535 (1983) report isolation of human genomic clones 

20 for human HLA — DR using a 175 base pair synthetic oligonucleotide. Finally, Anderson, at al., RNA.S. (U.S.A.), 80, 
pp. 6838 — 6842 (1983) report the isolation of genomic clone for bovine pancreatic trypsin inhibitor (BPTI) using a 
single probe 86 base pairs in length and constructed according to the known amino acid sequence of BPTI. The authors 
note a determination of poor prospects for isolating mRNA suitable for synethesis of a cDNA library due to apparent 
low levels of mRNA in initially targeted carotid gland and lung tissue sources and then address the prospects of success 

2$ in probing a genomic library using a mixture of labelled probes, stating: "More generally, mixed-sequence oligodeox- 
ynucleotide probes have been used to isolate protein genes of unknown sequence from cDNA libraries. Such probes 
are typically mixtures of 8 — 32 oligonucleotides, 14—17 nucleotides in length, representing every possible codon 
combination for a small stretch (5 — 6 residues) of amino acid sequence. Under stringent hybridization conditions that 
discriminate against incorrectly base-paired probes, these mixtures are capable of locating specific gene sequences 

30 in clone libraries of low-to-moderate complexity. Nevertheless, because of their short length and heterogeneity, mixed 
probes often lack the specificity required for probing sequences as complex as a mammalian genome. This makes 
such a method impractical for the isolation of mammalian protein genes when the corresponding mRNAs are unavail- 
able." (Citations omitted). 

There thus continues to exist a need in the art for improved methods for effecting the rapid and efficient isolation 
35 of cDNA clones in instances where little is known of the amino acid sequence of the polypeptide coded for and where 
"enriched" tissue sources of mRNA are not readily available for use in constructing cDNA libraries. Such improved 
methods would be especially useful if they were applicable to isolating mammalian genomic clones where sparse 
information is available concerning amino acid sequences of the polypeptide coded for by the gene sought. 

40 B. Erythropoietin As A Polypeptide Of Interest 

Erythropoiesis, the production of red blood cells, occurs continuously throughout the human life span to offset cell 
destruction. Erythropoiesis is a very precisely controlled physiological mechanism enabling sufficient numbers of red 
blood cells to be available in the blood for proper tissue oxygenation, but not so many that the cells would impede 
45 circulation. The formation of red blood cells occurs in the bone marrow and is under the control of the hormone, eryth- 
ropoietin. 

Erythropoietin, an acidic glycoprotein of approximately 34,000 dalton molcular weight, may occur in three forms: 
a, p and asialo. The a and p forms differ slightly in carbohydrate components, but have the same potency, biological 
activity and molecular weight. The asialo form is an a and p form with the terminal carbohydrate (sialic acid) removed. 

so Erythropoietin is present in very low concentrations in plasma when the body is in a healthy state wherein tissues 
receive sufficient oxygenation from the existing number of erythrocytes. This normal low concentration is enough to 
stimulate replacement of red blood cells which are lost normally through aging. 

The amount of erythropoietin in the circulation is increased under conditions of hypoxia when oxygen transport by 
blood cells in the circulation is reduced. Hypoxia may be caused by loss of large amounts of blood through hemorrhage, 

ss destruction of red blood cells by over-exposure to radiation, reduction in oxygen intake due to high altitudes or prolonged 
unconsciousness^ or various forms of anemia. In response to tissues undergoing hypoxic stress, erythropoietin will 
increase red blood cell production by stimulating the conversion of primitive precursor cells in the bone marrow into 
proerythroblasts which subsequently mature, synthesize hemoglobin and are released into the circulation as red blood 
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cells. When the number of red blood cells In circulation Is greater than needed for normal tissue oxygen requirements, 
erythropoietin in circulation is decreased. 

See generally, Testa, at al., Exp. Hematol., 8(Supp: 8), 144—152 (1980); Tong, at aL, J. Biol. Chem., 256(24), 
1 266—1 2672 (1 981 ); Goldwasser, J. Cell. Physiol., 110(Supp. i), 1 33—1 35 (1 982); Finch, Blood, 60(6), 1 241 —1 246 

5 (1982); Sytowski, at al, Expt. Hematol., 8(Supp 8), 52—64 (1980); Naughton, Ann. Clin. Lab.Sci., 13(5), 432—438 
(1983); Weiss, at al., Am. J. Vet Res., 44(10), 1832—1835 (1983); Lappin, at al., Exp. Hematol., 11(7), 661—666 
(1983); Baciu, at aL, Ann. N. Y Acad. ScL, 414, 66—72 (1983); Murphy, at al., Acta. Haematologica Japonica, 46(7), 
1380—1396 (1983); Dessypris, atal., Brit. J. Haematol. ,56, 295—306 (1984); and, Emmanouel, atal., Am. J. Physiol. 
. 247 ff Pt 2), F1 68— 76(1984). 

10 Because erythropoietin is essential in the process of red blood cell formation, the hormone has potential useful 

application in both the diagnosis and the treatment of blood disorders characterized by low or defective red blood cell 
production. See, generally, Pennathur-Das, at al., Blood, 63(5), 1168—71 (1984) and Haddy, Am. Jour. Ped. Hematol./ 
Oncol., 4, 191 — 196, (1982) relating to erythropoietin in possible therapies for sickle cell disease, and Eschbach, at 
al. J. Clin. Invest, 74(2), pp. 434 — 441, (1984), describing a therapeutic regimen for uremia sheep based on in viva 

is response to erythropoietin-rich plasma infusions and proposing a dosage of 1 0 U EPO/kg.per day for 1 5 — 40 days as 
corrective of anemia of the type associated with chronic renal failure. See also, Krane, Henry Ford Hops. Med. J., 31 
177— 181 (1983). 

It has recently been estimated that the availability of erythropoietin in quantity would allow for treatment each year 
of anemias of 1 ,600,000 persons in the United States alone. See, e.g., Morrison, "Bioprocessing in Space — an Over- 
20 view", pp. 557—571 in The World Biotech Report 1984, Volume 2: USA, (Online Publications, New York, N.Y 1984). 
Recent studies have provided a basis for projection of efficacy of erythropoietin therapy in a variety of disease states, 
disorders and state of hematologic irregularity; Vedovato, at al., Acta. Haematol, 71, 21 1 — 21 3 (1 984) (beta-thalassem- 
ia); Vichinsky, at al., J. Pediatr. 105(1), 15—21 (1984) (cystic fibrosis); Cotes, at al., Brit. J. Obstet. Gyneacol. 90(4), 
304—311 (1983) (pregnancy, menstrual disorders); Haga, et al, Acta. Pediatr. Scand.,72, 827—831 (1983) (early 
25 anemia of prematurity); Claus-Walker, at al., Arch. Phys. Med. Rehabil., 65, 370—374 (1984) (spinal cord injury); 
Dunn, at al., Eur. J. Appl. Physiol., 52, 178—182 (1984), (space flight); Miller, at al., Brit J. Haematol., 52., 545—590 
(1982) (acute blood loss); Udupa, at al., J. Lab. Clin. Med., 103(4), 574 — 580 and 581—588 (1984); and Lipschitz, at 
al., Blood, 63(3), 502—509 (1983) (aging); and Dainiak, at al., Cancer, 51(6), 1101—1106 (1983) and Schwartz, at 
a!., Otolaryngol., 109, 269 — 272 (1983) (various neoplastic disease states accompanied by abnormal erythropoiesis). 
30 Prior attempts to obtain erythropoietin in good yield from plasma or urine have proven relatively unsuccessful. 

Complicated and sophisticated laboratory techniques are. necessary and generally result in the collection of very small 
amounts of impure and unstable extracts containing erythropoietin. 

U.S. Letters Patent No. 3,033,753 describes a method for partially purifying erythropoietin from sheep blood plasma 
which provides low yields of a crude solid extract containing erythropoietin. 
35 initial attempts to isolate erythropoietin from urine yielded unstable, biologically inactive preparations of the hor- 

mone. U.S. Letters Patent No: 3,865,801 describes a method of stabilizing the biological activity of a crude substance 
containing erythropoietin recovered from urine. The resulting crude preparation containing erythropoietin purportedly 
retains 90% of erythropoietin activity, and is stable. 

Another method of purifying human erythropoietin f rbm urine of patients with aplastic anemia is described in Miyake, 
40 at al., J. Biol. Chem., Vol. 252, No. 15 (August 10, 1977), pp. 5558—5564. This seven-step procedure includes ion 
exchange chromatography, ethanol precipitation, gel filtration, and adsorption chromatography, and yields a pure eryth- 
ropoietin preparation with a potency of 70,400 units/mg of protein in 21 % yield. 

U.S. Letters Patent No. 4,397,840 to Takezawa, at al., describes methods of preparing "an erythropoietin product" 
from healthy human urine specimens with weakly basic ion exchangers and proposes that the low molecular weight 
45 products obtained "have no inhibitory effects against erythropoietin. 

U.K. Patent Application No. 2,085,887 by Sugimoto, at al., published May 6, 1982, describes a process for the 
production of hybrid human lymphoblastoid cells, reporting production levels ranging from 3 to 420 Units of erythro- 
poietin per ml of suspension of cells (distributed into the cultures after mammalian host propagation containing up to 
10 7 cells per ml. At the highest production levels asserted to have been obtained, the rate of erythropoietin production 
50 could be calculated to be 40 to about 4,000 Units/10 6 cells/48 hours in in vitro culture following transfer of cells from 
in vivo propagation systems. (See also the equivalent U.S. Letters Patent No. 4,377,513.) Numerous proposals have 
. been made for isolation of erythropoietin from tissue sources, including neoplastic cells, but the yields have been quite 
low. See, e.g., Jelkman, et al., Expt Hematol., 11(7), 581—588 (1983); Tambourin, at al., P.N.A.S. (U.S.A.), 80, 
6269—6273 (1983); Katsuoka, at al., Gann, 74, 534—541 (1983); Hagiwara, at al., Blood, 63(4), 828—835 (1984); 
55 and Choppin, at al. , Blood, 64(2), 341 —347 (1 984). 

Other isolation techniques utilized to obtain purified erythropoietin involve immunological procedures. A polyclonal, 
serum-derived antibody directed against erythropoietin is developed by injecting an animal, preferably a rat or rabbit, 
with human erythropoietin. The injected human erythropoietin is recognized as a foreign antigenic substance by the 
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immune system of the animal and elicits production of antibodies against the antigen. Differing cells responding to 
stimulation by the antigenic substance produce and release into circulation antibodies slightly different from those 
produced by other responding cells. The antibody activity remains in the serum of the animal when its blood is extracted. 
While unpurified serum or antibody preparations purified as a serum immunoglobulin G fraction may then be used in 

s asssays to detect and complex with human erythropoietin, the materials suffer from a major disadvantage. This serum . 
antibody, composed of all the different antibodies produced by individual cells, is polyclonal in nature and will complex 
with components in crude extracts other than erythropoietin alone. 

Of interest to the background of the present invention are recent advances in the art of developing continuous 
cultures of cells capable of producing a single species of antibody which is specifically immunologically reactive with 

10 a single antigenic determinant of a selected antigen. See, generally, Chisholm, High Technology, Vol. 3, No. 1 , 57 — 63 
(1983). Attempts have been made to employ cell fusion and hybridization techniques to develop "monoclonal" anti- 
bodies to erythropoietin and to employ these antibodies in the isolation and quantitative detection of human erythro- 
poietin. As one example, a report of the successful development of mouse-mouse hybridoma cell lines secreting mon- 
oclonal antibodies to human erythropoietin appeared in abstract form in Lee-Huang, Abstract No. 1463 of Fed. Proa, 

is 41, 520 (1982). As another example, a detailed description of the preparation and use of a monoclonal, anti-erythro- 
poietin antibody appears in Weiss, atal., RN.AS. (U.S.A.), 79,5465—5469(1982). See also, Sasaki, Biomed. Biochim. 
Acta., 42(11/12), S202— S206 (1983); Yanagawa, at a!., Blood, 64(2), 357—364 (1984); Yanagawa, at al., J. Biol. 
Chem., 259(5), 2707—2710 (1 984); and U. S Letters Patent No. 4,465,624. 

Also of interest to the background of the invention are reports of the immunological activity of synthetic peptides 

20 which substantially duplicate the amino acid sequence extant in naturally-occurring proteins, glycoproteins and nucle- 
oproteins. More specifically, relatively low molecular weight polypeptides have been shown to participate in immune 
reactions which are similar in duration and extent to the immune reactions of physiologically significant proteins such 
as viral antigens, polypeptide hormones, and the like. Included among the immune reactions of such polypeptides is 
the provocation of the formation of specific antibodies in immunologically active animals. See, e.g., Lerner, at al., Cell, 

25 23, 309— 310 (1981); Ross, at al., Nature, 294, 654—656 (1981); Walter, at al., RN.AS. (U.S.A.), 77, 5197—5200 
(1980); Lerner, at al., RN.A S. (U.S.A.), 78, 3403—3407 (1981); Walter, at al., RN.A S. (U.S.A.), 78, 4882—4886 
. (1981); Wong, at al., RN.AS. (USA), 78, 7412—7416 (1981); Green at al., Cell, 28, 477—487 (1982); Nigg, at al., 
RN.AS. (USA), 79, 5322—5326 (1982); Baron, at al., Cell, 28, 395—404 (1982); Dreesman, at al., Nature, 295, 
158—160 (1982); and Lerner, Scientific American, 248, No. 2, 66—74 (1983). See, also, Kaiser, at al., Science, 223, 

30 pp. 249 — 255 (1 984) relating to biological and immunological activities of synthetic peptides which approximately share 
secondary structures of peptide hormones but may not share their primary structural conformation. The above studies 
relate, of course, to amino acid sequences of proteins other than erythropoietin, a substance for which no substantial 
amino acid sequence information has been published. In co-owned, co-pending U.S. Patent Application Serial No. 
463,724, filed Februarv 4, 1983, by J. Egrie, published August 22, 1984 as European Patent Application No. 0 116 

3S 446, there is described a mouse-mouse hybridoma cell line (A.T.C.C. No. HB8209) which produces a highly specific 
monoclonal, anti-erythropoietin antibody which is also specifically iummunoreactive with a polypeptide comprising the 
following sequence of amino acids: 
following sequence of amino acids: 

40 

NHj-Ala-Pro-Pro-Arg-Leu-lle-Cys-Asp-Ser-Arg-Val-Leu-Glu-Arg-Tyr-Leu-Lou-Glu-Ala-Lys-COOH. 

The polypeptide sequence is one assigned to the first twenty amino acid residues of mature human erythropoietin 
isolated according to the method of Miyake, at al., J. Biol. Chem., 252, 5558-5564 (1 977) and upon which amino acid 

45 analysis was performed by the gas phase sequencer (Applied Biosystems, Inc.) according to the procedure of Hewick, 
M., at al., J. Biol. Chem., 256, 7990—7997 (1981). See, also, Sue, et al., Proc. Nat. Acad. Sci. (USA), 80, pp. 
3551 — 3655 (1983) relating to development of polyclonal antibodies against a synthetic 26-mer based on a differing 
amino acid sequence, and Sytowski, at al., J. Immunol. Methods, 69, pp. 181 — 186 (1984). 

While polyclonal and monoclonal antibodies as described above provide highly useful materials for use in immu- 

so noassays for detection and quantification of erythropoietin and can be useful in the affinity purification of erythropoietin, 
it appears unlikely that these materials can readily provide for the large scale isolation of quantities of erythropoietin 
from mammalian sources sufficient for further analaysis, clinical testing and potential wide-ranging therapeutic use of 
the substance in treatment of, e.g., chronic kidney disease wherein diseased tissues fail to sustain production of eryth- 
ropoietin. It is consequently projected in the art that the best prospects for fully characterizing mammalian erythropoietin 

55 and providing large quantities of it for potential diagnostic and clinical use involve successful application of recombinant 
procedures to effect large scale microbial synthesis of the compound. 

While substantial efforts appear to have been made in attempted isolation of DNA sequences coding for human 
and other mammalian species erythropoietin, none appear to have been successful. This is due principally to the 
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scarcity of tissue sources, especially human tissue sources, enriched in mRNA such as would allow for construction 
of a cDNA library from which a DNA sequence coding for erythropoietin might be isolated by conventional techniques. 
Further, so little is known of the continuous sequence of amino acid residues of erythropoietin that it is not possible to 
construct, e.g. , long polynucleotide probes readily capable of reliable use in DMA/DN A hybridization screening of cDN A 

5 and especially genomic DNA libraries. Illustratively, the twenty amino acid sequence employed to generate the above- 
named monoclonal antibody produced by A.T.C.C. No. HB8209 does not admit to the construction of an unambiguous, 
60 base oligonucleotide probe in the manner described by Anderson, at al. , supra. It is estimated that the human gene 
for erythropoietin may appear as a "single copy gene" within the human genome and, in any event, the genetic material 
coding for human erythropoietin is likely to constitute less than 0.00005% of total human genomic DNA which would 

10 be present in a genomic library. 

To date, the most successful of known reported attempts at recombinant-related methods to provide DNA sequenc- 
es suitable for use in microbial expression of isolatable quantities of mammalian erythropoietin have fallen far short of 
the goal. As an example, Farber, at al. Exp. HematoL, 11. Supp. 14, Abstract 101 (1983) report the extraction of mRNA 
from kidney tissues of phenylhydrazine-treated baboons and the injection of the mRNA into Xenopus laevis oocytes 

15 with the rather transitory result of in vitro production of a mixture of "translation products" which included among them 
displaying biological properties of erythropoietin. More recently, Farber, at al., Blood, 62, No. 5, Supp. No. 1, Abstract 
392, at page 1 22a (1983) reported the in vitro translation of human kidney mRNA by frog ooctyes. The resultant trans- 
lation product mixture was estimated to include on the order of 220 mU of a translation product having the activity of , 
erythropoietin per microgram of injected mRNA. While such levels of in vitro translation of exogenous mRNA coding 

20 for erythropoietin were acknowledged to be quite low (compared even to the prior reported levels of baboon mRNA 
translation into the sought-for product) it was held that the results confirm the human kidney as a site of erythropoietin 
expression, allowing for the construction of an enriched human kidney cDNA library from which the desired gene might 
be isolated. [See also, Farber, Clin. Res., 31(4), 769A (1983).] 

Since the filing of U.S. Patent Application Serial Nos. 561 ,024 and 582,1 85, there has appeared a single report of 

25 the cloning and expression of what is asserted to have been human erythropoietin cDN A in E. coti. Briefly put, a number 
of cDNA clones were inserted into E. coli plasmids and (^-lactamase fusion products were noted to be immunoreactive 
with a monoclonal antibody to an unspecified "epitope" of human erythropoietin. See, Lee-Huang, Proc. Nat. Acad. 
Sci. (USA), 81, pp. 2708—2712 (1984). 

the genetic material coding for human erythropoietin is likely to constitute less than 0.00005% of total human 

30 genomic DNA which would be present in a genomic library. 

To date, the most successful of known reported attempts at recombinant-related methods to provide DNA sequenc- 
es suitable for use in microbial expression of isolatable quantities of mammalian erythropoietin have fallen far short of 
the goal. As an example, Farber, et al. Exp. HematoL, 11, Supp. 14 Abstract 101 (1983) report the extraction of mRNA 
from kidney tissues of phenylhydrazine-treated baccons and the injection of the mRNA into Xenopus laevis oocytes 

35 with the rather transitory result of in vitro production of a mixture of "translation procucts" which included among them 
displaying biological properties of erythropoietin. More recently, Farber, et al., Blood, 62, No. 5. Supp. No. 1, Abstract 
392, at page 122a (1983) reported the in vit ro translation of human kidney mRNA by frog ooctyes. The resultant trans- 
lation product: mixture was estimated to include on the order of 220 mU of a translation product having the activity of 
enyrthropoietin per microgram of injected mRNA. While such levels of in vitro translation of exogenous mRNA coding 

40 for erythropoietin were acknowledged to be cuite low (compared even to the prior reported levels of babcon mRNA 
translation into the sought-for product) it was held that the results confirm the human kidney as a site of erythropoietin 
exoression, allowing for the construction of an enriched human kidney cDNA library from which the desired gene might 
be isolated. [See also, Farber, Clin. Res., 31(4), 769A (1983).] 

Since the filing of U.S. Patent Application Serial Nos. 561 ,024 and 582,185, there has appeared a single report of 

45 the cloning and expression of what is asserted to have been human erythropoietin cDNA in E. coli. Briefly put. a number 
of cDNA clones were inserted into E. coli plasmids and p-lactamase fusion products were noted to be immunoreactive 
with a monoc tonal antibocy to an unspecified "epitope" of human erythropoietin. See, Lee-Huang, Proc Nat. Acad. 
Sci. (USA), 81, pp. 2708-271 2 ( 1 984). 

50 Brief Summary 

The present invention provides: - 

1 . A DNA sequence for use in securing expression in a procaryotic or eucaryotic host cell of a polypeptide product 
55 having at least part of the primary structural confirmation of that of erythropoietin to allow possession of the bio- 

logical property of causing bone marrow cells to increase production of reticulocytes and red blood cells and to 
increase hemoglobin synthesis or iron uptake, said DNA sequence selected from the group consisting of: 
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(a) the DNA sequences set out in Tables V and VI or their complementary strands; . 

(b) DNA sequences which hybridize under stringent conditions to the protein coding regions of the DNA se- 
quences defined in (a) or fragments thereof; and 

(c) DNA sequences which, but for the degeneracy of the genetic code, would hybridize to the DNA sequences 
defined in (a) and (b). 

2. A DNA sequence according to point 1 encoding human erythropoietin. 

3. A cDNA sequence according to point 1 being a monkey species erythropoietin coding DNA sequence. 

4. A DNA sequence according to point 3 and including the protein coding region set forth in Table V. 

5. A genomic DNA sequence according to point 1 or 2. 

6. A human species erythropoietin coding DNA sequence according to point 5. 

7. A DNA sequence according to point 6 and including the protein coding region set forth in Table VI. 

8. A DNA sequence according to point 1 or 2, covalently associated with a detectable label substance. 

9. A DNA sequence according to point 8, wherein the detectable label is a radiolabel. 

1 0. A single-strand DNA sequence according to point 8 or 9. 

11. A DNA sequence according to point 1, coding for [Phe 15 ]hEPO, [Phe 49 ]hEPO, [Phe 145 ]hEPO, [His 7 ]hEPO t 
[Asn2 des-Pro 2 through lle 6 ]hEPO, [des-Thr 16 ! through Arg 166 ]hEPO, or[A27-55]hEPO. 

12. A procaryotic or eucaryotic host cell transformed or transfected with a DNA sequence according to any one of 
points 1 , 2, 3, 6, 7 and 8, in a manner allowing the host cell to express said polypeptide product. 

13. A transformed or transfected host cell according to point 12 which host cell is capable of glycosylating said 
polypeptide. 

1 4. A transformed or transfected mammalian host cell according to point 1 3. 

15. A transformed or transfected COS cell according to point 13. 

16. Atransformed or transfected CHO cell according to point 13. 

17. A biologically functional circular plasmid or viral DNA vector including a DNA sequence according to any one 
of points 1 , 2, 3, 5, 6, 7, or 11 . 

1 8. A procaryotic or eucaryotic host cell stably transformed or transfected with a DNA vector according to point 1 7. 

1 9. A recombinant polypeptide having part or all of the primary structural conformation of human or monkey eryth- 
ropoietin as set forth in Table VI or Table V or any allelic variant or derivative thereof possessing the biological 
property of causing bone marrow cells to increase production of reticulocytes and red blood cells to increase 
hemoglobin synthesis or iron uptake and characterized by being the product of eucaryotic expression of an exog- 
enous DNA sequence and which has higher molecular weight by SDS-PAGE from erythropoietin isolated from 
urinary sources. 

20. A glycoprotein polypeptide according to point 19 having an average carbohydrate composition which differs 
from that of human erythropoietin isolated from urinary sources. 

21 . A polypeptide according to point 1 9 or 20 wherein the exogenous sequence is a cDNA sequence. 
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22. A polypeptide according to point 1 9 or 20 wherein the exogenous DNA sequence is a genomic DN A sequence. 

23. A polypeptide according to point 1 9 or 20 wherein the exogenous DNA sequence is carried on an autonomously 
replicating circular DNA plasmid or viral vector. 

24. A polypeptide according to any one of points 19 to 23 further characterized by being covalently associated 
with a detectable label substance. 

25. A polypeptide according to point 24, wherein said detectable label is a radiolabel. 

26. A polypeptide product of the expression in a eucaryotic host cell of a DNA sequence according to any of points 
1, 2, 3, 5, 6 and 7. 

27. A process for production of a polypeptide having at least part of the primary structural conformation of eryth- 
ropoietin to allow possession of the biological property of causing bone marrow cells to increase production of 
reticulocytes and red blood cells and to increase hemoglobin synthesis or iron uptake, which process is charac- 
terized by culturing under suitable nutrient conditions a procaryotic or eucaryotic host cell transformed or trans- 
fected with a DNA sequence according to any of points 1, 2, 3, 5, 6 and 7 in a manner allowing the host cell to. 
express said polypeptide; and optionally isolating the desired polypeptide product of the expression of the DNA 
sequence. 

28. A process according to point 27, characterized by culturing a host cell of any one of points 12 to 16. 

29. A process according to point 27 or 28 for production of a polypeptide of any one of points 1 9 to 23 and 26. 

30. A pharmaceutical composition comprising a polypeptide produced in accordance with the process of point 27, 
28 or 29 and a pharmaceutical^ acceptable diluent, adjuvant or carrier. 

31. A pharmaceutical composition according to point 30, comprising a polypeptide of any one of points 19 to 23 
and 26. 

Vertebrate (e.g., COS-1 and CHO) cells provided by the present invention comprise the first cells ever available 
which can be propagated in vitro continuously and which upon growth in culture are capable of producing in the medium 
of their growth in excess of 100U (preferably in excess of 500U and most preferably in excess of 1 ,000 to 5,000U) of 
erythropoietin per 10 6 cells in 48 hours as determined by radioimmunoassay. 

Also provided by the present invention are synthetic polypeptides wholly or partially duplicative of continuous 
sequences of erythropoietin amino acid residues which are herein for the first time elucidated. These sequences, by 
virtue of sharing primary, secondary or tertiary structural and conformational characteristics with naturally-occurring 
erythropoietin may possess biological activity and/or immunological properties in common with the naturally-occurring 
product such that they may be employed as biologically active or immunological substitutes for erythropoietin in ther- 
apeutic and immunological processes. 

I llustrating the present invention are cloned DNA sequences of monkey and human species origins and polypeptide 
sequences suitably deduced therefrom which represent, respectively, the primary structural conformation of erythro- 
poietins of monkey and human species origins. 

Also provided by the present invention are novel biologically functional viral and circular plasmid DNA vectors 
incorporating DNA sequences of the invention and microbial (e.g. , bacterial, yeast and mammalian cell) host organisms 
stably transformed or transfected with such vectors. Correspondingly provided by the invention are novel methods for 
the production of useful polypeptides comprising cultured growth of such transformed or transfected microbial hosts 
under conditions facilitative of large scale expression of the exogenous, vector-borne DNA sequences and isolation 
of the desired polypeptides from the growth medium, cellular lysates or cellular membrane fractions. 

Isolation and purification of microbially expressed polypeptides provided by the invention may be by conventional 
means including, e.g., preparative chromatographic separations and immunological separations involving monoclonal 
and/or polyclonal antibody preparations. 

Having herein elucidated the secuence of amino acid residues of erythropoietin, the present invention provides 
for the total and/or partial manufacture of DNA sequences coding for erythropoietin and including such advantageous 
characteristics as incorporation of codons "preferred" for expression by selected non-mammalian hosts, provision of 
sites for cleavage by restriction endonuclease enzymes and provision of additional initial, terminal or intermediate DNA 
secuences which lacilitate construction of readily expressed vectors. The Examples of the present invention provide 
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for manufacture (and development by site specific mutagenesis of monkey cDN A and genomic D N A) of DN A sequences 
coding for microbial expression of polypeptide analogs or derivatives of erythropoietin which differ from naturally-oc- 
curing forms in terms of the identity or location of one or more amino acid residues (i.e., deletion analogs containing 
less than all of the residues specified for EPO and/or substitution analogs wherein one or more residues specified are 
5 replaced by other residues and/or addition analogs wherein one or more amino acid residues is added to a terminal 
or medial portion of the polypeptide); and which share some or all the properties of naturally-occurring forms. 

Novel DNA sequences of the invention include all sequences useful in securing expression in procaryotic or eu- 
caryotic host cells of polypeptide products having at least a part of the primary structural conformation and one or more 
of the biological properties of erythropoietin which are comprehended by: (a) the DNA sequences set out in Tables V 
10 and VI herein or their complementary strands; (b) DNA seqeuences which hybridize (under hybridization conditions 
such as illustrated herein or more stringent conditions) to DNA sequences defined in (a) or fragments thereof; and (c) 
DNA sequences which, but for the degeneracy of the genetic code, would hybridize to DNA sequences define in (a) 
and (b) above. Specifically comprehended in part (b) are genomic DNA sequences encoding allelic variant forms of 
monkey and human erythropoietin and/or encoding other mammalian species of erythropoietin. Specifically compre- 
ss hended by part (c) are manufactured DNA sequences encoding EPO, EPO fragments and EPO analogs which DNA 
sequences may incorporate codons facilitating translation of messenger RNA in non -vertebrate hosts. 

Comprehended by the present invention is that class of polypeptides coded for by portions of the DNA complement 
to the top strand human genomic DNA sequence of Table VI herein, i.e., "complementary inverted proteins" as described 
by Tramontane et al., Nucleic Acids Research, 12, pp. 5049-5059 (1984). 
20 Also comprehended by the invention are pharmaceutical compositions comprising effective amounts of polypeptide 

products of the invention together with suitable pharmaceutical^ acceptacle diluent(s), adjuvant(s) and/or carrier(s) 
which allow for provision of erythropoietin therapy, especially in the treatment of anemic disease states and most 
especially such anemic states as attend chronic renal failure. 

Polypeptide products of the invention may be "labelled" by covalent association with a detectable marker substance 
25 (e.g.. radiolabeled with 125 l) to provide reagents useful in detection and quantification of erythropoietin in solid tissue 
and fluid samples such as blood or urine. DNA products of the invention may also be labelled with detectable markers 
(such as radiolabels and non-isotopic labels such as biotin) and employed in DNA hybridization processes to locate 
the erythropoietin gene position and/or the position of any related gene family in the human, monkey and other mam- 
malian species chromosomal map. They can also be used for identifying the erthropoietin gene disorders at the DNA 
30 level and used as gene markers for identifying neighboring genes and their disorders. 

As hereinafter described in detail, the present invention further provides significant improvements in methods for 
detection of a specific single stranded polynucleotide of unknown sequence in a heterogeneous cellular or viral sample 
including multiple single-stranded polynucleotides where 

35 (a) a mixture of labelled single-stranded polynucleotide probes is prepared having uniformly varying sequences 

of bases, each of said probes being potentially specifically complementary to a sequence of bases which is puta- 
tively unique to the polynucleotide to be detected, 

(b) the sample is fixed to a solid substrate, 

(c) the substrate having the sample fixed thereto is treated to diminish further binding of polynucleotides thereto 
40 except by way of hybridization to polynucleotides in said sample, 

(d) the treated substrate having the sample fixed thereto is transitorily contacted with said mixture of labelled 
probes under conditions facilitative of hybridization only between totally complementary polynucleotides, and 

(e) the specific polynucleotide is detected by monitoring for the presence of a hybridization reaction between it 
and a totally complementary probe within said mixture ol labelled probes, as evidenced by the presence of a higher 

45 density of labelled material on the substrate at the locus of the specific polynucleotide in comparison to a back- 

ground density of labelled material resulting from non-specific binding of labelled probes to the substrate. 

The procedures are especially effective in situations dictating use of 64, 128, 256, 512, 1024 or more mixed poly- 
nucleotide probes having a length of 17 to 20 bases in DNA/DNA or RNA/RNA or DNA/RNA hybridizations. 

so As described infra, the above-noted improved procedures have illustratively allowec for the identification of cDN A 

clones coding for erythropoietin of monkey species origins within a library prepared from anemic monkey kidney cell 
mRN A. More specifically, a mixture of 1 28 uniformly varying 20-mer probes based on amino acid sequence information 
derived from sequencing fractions of human erythropoietin was employed in colony hybridization procedures to identify 
seven "positive" erythropoietin monkey cDNA clones within a total of 200,000 colonies. Even more remarkably, practice 

55 of the improved procedures of the invention have allowed for the rapid isolation of three positive clones from within a 
screening of 1 ,500,000 phage plaques constituting a human genomic library. This was accomplished through use of 
the above-noted mixture of 1 28 20-mer probes together with a second set of 1 28 17-mer probes based on amino acid 
analysis of a different continuous sequence of human erythropoietin. 
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The above-noted illustrative procedures constitute the first known instance of the use of multiple mixed oligonu- 
cleotide probes in DNA/DNA hybridization processes directed toward isolation of mammalian genomic clones and the 
first known instance of the use of a mixture of more than 32 oligonucleotide probes in the isolation of cDNA clones. 

Numerous aspects and advantages of the invention will be apparent to those skilled in the art upon consideration 
5 of the following detailed description which provides illustrations of the practice of the invention in its presently preferred 
embodiments. 

Detailed Description 

10 According to the present invention, DNA sequences encoding part or all of the polypeptide sequence of human 

and monkey species erythropoietin (hereinafter, at times, "EPO") have been isolated and characterized. Further, the 
monkey and human origin DNA has been made the subject of eucaryotic and procaryotic expression providing isolatable 
quantities of polypeptides displaying biological (e.g.. immunological) properties of naturally-occurring EPO as well as 
both in vivo and in vitro biological activities of EPO. 

is The DNA of monkey species origins was isolated from a cDNA library constructed with mRNA derived from kidney 

tissue of a monkey in a chemically induced anemic state and whose serum was immunologically determined to include 
high levels of EPO compared to normal monkey serum. The isolation of the desired cDNA clones containing EPO 
encoding DNA was accomplished through use of DNA/DNA colony hybridization employing a pool of 128 mixed, radi- 
olabeled, 20-mer oligonucleotide probes and involved the rapid screening of 200,000 colonies. Design of the oligonu- 

20 cleotide probes was based on amino acid sequence information provided by enzymatic fragmentation and sequencing 
a small sample of human EPO. 

The DNA of human species origins was isolated from a human genomic DNA library. The isolation of clones con- 
taining EPO-encoding DNA was accomplished through DNA/DNA plaque hybridization employing the above-noted 
pool of 128 mixed 20-mer oligonucleotide probes and a second pool of 128 radiolabelled 17-mer probes whose se- 

2S quences were based on amino acids sequence information obtained from a different enzymatic human EPO fragment. 

Positive colonies and plaques were verified by means of dideoxy sequencing of clonal DNA using a subset of 16 
sequences within the pool of 20-mer probes and selected clones were subjected to nulceotide sequence analysis 
resulting in deduction of primary structural conformation of the EPO polypeptides encoded thereby The deduced 
polypeptide sequences displayed a high degree of homology to each other and to a partial sequence generated by 

30 amino acid analysis of human EPO fragments. 

A selected positive monkey cDNA clone and a selected positive human genomic clone were each inserted in a 
"shuttle" DNA vector which was amplified in E.coli and employed to transfect mammalian cells in culture. Cultured 
growth of transfected host cells resulted in culture medium supernatant preparations estimated to contain as much as 
3000 mil of EPO per ml of culture fluid. 

35 The following examples are presented by way of illustration of the invention and are specifically directed to proce- 

cures carried out prior to identification of EPO encoding monkey cDNA clones and human genomic clones, to proce- 
dures resulting in such identification, and to the sequencing, development of expression systems and immunological 
verification of EPO expression in such systems. 

More particularly, Example 1 is directed to amino acid sequencing of human EPO fragments and construction of 

40 mixtures of radiolabelled probes based on the results of this sequencing. Example 2 is generally directed to procedures 
involved in the identification of positive monkey cDNA clones and thus provides information concerning animal treat - 
. ment and preliminary radioimmunoassay (RIA) analysis of animal sera. Example 3 is directed to the preparation of the 
cDNA library, colony hybridization screening and verification of positive clones, DNA sequencing of a positive cDNA 
clone and the generation of monkey EPO polypeptides primary structural conformation (amino acid sequence) infor- 
ms mation. Example 4 is directed to procedures involved in the identification of positive human genomic clones and thus 
provides information concerning the source of the genomic library, plaque hybridization procedures and verification of 
positive clones. Example 5 is directed to DNA sequencing of a positive genomic clone and the generation of human 
EPO polypeptide amino acid sequence information including a comparison thereof to the monkey EPO sequence 
information. Example 6 is directed io procedures for construction of a vector incorporating EPO-encoding DNA derived 

50 from a positive monkey cDNA clone, the use of the vector for transfect ion ol COS-1 cells and cultured growth of the 
transfected cells. Example 7 is directed to procedures for construction of a vector incorporating EPO-encoding DNA 
derived from a positive human genomic clone, the use of the vector for transfection of COS-1 cells and the cultured 
growth of the transfected cells. Example 8 is directed to immunoassay procedures performed on media supernatants 
obtained from the cultured growth of transfected cells according to Example 6 and 7. Example 9 is directed to in vitro 

55 and in vivo biological activity of microbially expressed EPO of Examples 6 and 7. 

Example 10 is directed to a development of mammalian host expression systems for monkey species EPO cDNA 
and human species genomic DNA involving Chinese hamster ovary ("CHO") cells and to the immunological and bio- 
logical activities of products of these expression systems as well as characterization of such products. Example 11 is 
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15 



directed to the preparation of manufactured genes encoding human species EPO and EPO analogs, which genes 
include a number of preference condons for expression in E. colt and yeast host cells, and to expression systems 
based thereon. Example 12 relates to the immunological and biological activity profiles of expression products of the 
systems of Example II. 

Example 1 

A. Human EPO Fragment Amino Acid Sequencing 

Human EPO was isolated from urine and subjected to tryptic digestion resulting in the development and isolation 
of 17 discrete fragments in quantities approximating 100 — 150 picomoles. 

Fragments were arbitrarily assigned numbers and were analyzed for amino acid sequence by microsequence 
analysis using a gas phase sequencer (Applied Biosystems) to provide the sequence information set out in Table I, 
below, wherein single letter codes are employed and °X" designates a residue which was not unambiguously deter- 
mined. 



TA8LE I 



20 



Fragment No, 



Sequence Analysis Result 



25 



30 



35 



40 



45 



SO 



T4a 

TAb 

T9 . 

T13 

T16 

T18 

T21 

J25' 

T26a 

T26b 

T27 

T28 

T30 

T31 
*T33 
T35 
T38 



A-P-P-R 

G-K-L-K 

A-L-G-A-Q-K 

V-L-E-R 

A-V-S-G-L-R 

L-F-R 

K-L-F.R 

Y-t-L-E-A-K 

L-I-C-0-S-R 

L-Y-T-G-E-A-C-R 

T-I-T-A-CUT-F-R 

E-A-I-5-P-P-0-A-A-M-A-A-P-L-R 

E-A-E-X-I-T-T-G-X-A-E-H-X-S-l- 

N-E-X-I-T-V-P 
V-Y-S-N-F-L-R 
S-L-T-T-L-L-R 
V-N-F-Y-A-W-K 
G-Q-A-L-L-V-X-S.S-Q-P-W- 

E-P-L-Q-L-H-V-D-K 



55 



B. Design and Construction of Oligonucleotide Probe Mixtures 

The amino acid sequences set out in Table I were reviewed in the context of the degeneracy of the genetic code 
for the purpose of ascertaining whether mixed probe procedures could be applied to DNA/DN A hybridization procedures 
on cDNA and/or genomic DNA libraries. This analysis revealed that within Fragment No. T35 there existed a series of 
7 amino acid residues (Val-Asn-Phe-Tyr-Ala-Trp-Lys) which could be uniquely characterized as encoded for by one of 
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128 possible DNA sequences spanning 20 base pairs. A first set of 128 20-mer oligonucleotides was therefore syn- 
thesized by standard phosphoamidite methods (See, e.g., Beaucage, at al., Tetrahedron Letters, 22, pp. 1859—1862* 
(1981) on a solid support according to the sequence set out in Table II, below. 

5 TABLE II 

Residue - Val - Asn Phe Tyr Ala Trp Lys 

10 y CAA TTG AAG ATG CGA ACC TT -5' 

T A A A T 

G G 

C C. 



75 



Further analysis revealed that within fragment No. T38 there existed a series of 6 amino acid residues (Gln-Pro- 
Trp-Glu-Pro-Leu) on the basis of which there could be prepared a pool of 128 mixed oligonucleotide 17-mer probes 
as set out in Table III below 



20 



TABLE III 



Residue - Gin Pro Trp Glu Pro Leu 

3' GTT GGA ACC CTT GGA GA - 5' 
25 CI C T A 

G G 
C C 



so Oligonucleotide probes were labelled at the 5 1 end with gamma — 32 P — ATP, 7500—8000 Ci/mmole (ICN) using 

T, polynucleotide kinase (NEN). 

Example 2 

35 A. Monkey Treatment Procedures and RIA Analysis 

Female Cynomolgus monkeys Macaca fascicularias (2.5 — 3 kg, 1 .5—2 years old) were treated subcutaneously 
with a pH 7.0 solution of phenylhydrazine hydrochloride at a dosage level of 12.5 mg/kg on days 1, 3 and 5. The 
hematocrit was monitored prior to each injection. On day 7, or whenever the hematocrit level fell below 25% of the 
40 initial level, serum and kidneys were harvested after administration of 25 mg/kg doses of ketamine hydrochloride. 
Harvested materials were immediately frozen in liquid nitrogen and stored at -70°C. 

B. RIA for EPO 

45 Radioimmunoassay procedures applied for quantitative detection of EPO in samples were conducted according 

to the following procedures: * 

An erythropoietin standard or unknown sample was incubated together with antiserum for two hours at S7°C. After 
the two hour incubation, the sample tubes were cooled on ice, 125 l-labelled erythropoietin was added, and the tubes 
were incubated at 0°C for at least 15 more hours. Each assay tube contained 500 uJ of incubation mixture consisting 

so of 50 uJ of diluted immune sera, 10,000 cam of 125 l -erythropoietin, 5 u.l trasylol and 0 — 250 \x\ of either EPO standard 
or unknown sample, with PBS containing 0.1% BSA making up the remaining volume. The antiserum used was the 
second test bleed of a rabbit immunized with a 1% pure preparation of human urinary erythropoietin. The final antiserum 
dilution on the assay was adjusted so that the antibody4)ound 125 l-EPO did not exceed 10 — 20% of the input total 
counts. In general, this corresponded to a final antiserum dilution of from 1:50,000 to 1:100,000. 

ss The antibody-bound 125 l -erythropoietin was precipitated by the addition of 1 50 uJ Staph A. After a 40 min. incuba- 

tion, the samples were centrifuged and the pellets were washed two times with 0.75 ml 10 mM Tris-HCI pH 8.2 con- 
taining 0.1 5M NaCI, 2mM EDTA, and 0.05% Triton X-100. The washed pellets were counted in a gamma counter to 
determine the percent of 125 l-erythropoietin bound. Counts bound by pre-immune sera were subtracted from all final 
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values to correct for nonspecific precipitation. The erythropoietin content of the unknown samples was determined by 
comparison to the standard curve. 

The above procedure was applied to monkey serum obtained in Part A, above, as well as to the untreated monkey 
serum. Normal serum levels were assayed to contain approximately 36 mU/ml while treated monkey serum contained 
s from 1000 to 1700 mU/ml. 

Example 3 

A. Monkey cDNA Library Construction 

10 

Messenger RNA was isolated from normal and anemia monkey kidneys by the guanidinium thiocyanate procedure 
of Chirgwin, ataL, Biochemistry, 18, p. 5294 (1979) and poly (A) + mRNA was purified by two runs of oligo(dT)-ceullose 
column chromatography, as described at pp. 197-198 in Maniatis, at al, "Molecular Cloning, A Laboratory Manual" 
(Cold Springs Harbor Laboratory, Cold Springs, Harbor, N.Y., 1982). The cDNA library was constructed according to 

is a modification of the general procedures of Okayama, at al., Mol and Cell Biol, 2, pp. 161—170 (1982). The key 
features of the presently preferred procedures were as follows: (1) pUC8 was used as the sole vector, cut with Pstl 
and then tailed with oligo dT of 60 — 80 bases in length; (2) Hindi digestion was used to remove the oligo dT tail from 
one end of the vector; (3) first strand synthesis and oligo dG tailing was carried out according to the published procedu re; 
(4) BarrMX digestion was employed to remove the oligo dG tail from one end of the vector; and (5) replacement of the 

20 RNA strand by DNA was in the presence of two linkers 

(GATCTAAAGACCGTCCCCCCCCC and ACGGTCTTTA) 

25 jn a three-fold molar excess over the oligo dG tailed vector. 

B. Colony Hybridization Procedures For Screening Monkey cDNA Library 

Transformed E. co// were spread out at a density of 9000 colonies per 1 0 X 1 0 cm plate on nutrient plates containing 
30 50 micrograms/ml Ampicillin. GeneScreen filters (New England Nuclear Catalog No. NEF-972) were pre-wet on a 
BHI — CAM plate (Bacto brain heart infusion 37 g/L, Casamino acids 2 g/L and agar 1 5 g/L, containing 500 micrograms/ 
ml Chloramphenicol) and were used to lift the colonies off the plate. The colonies were grown in the same medium for 
1 2 hours or longer to amplify the plasmid copy numbers. The amplified colonies (colony side up) were treated by serially 
placing the fitters over 2 pieces of Whatman 3 MM paper saturated with each of the following solutions: 

35 

(1) 50 mM glucose— 25 mM Tris-HCI (ph 8.0)— 10 mM EDTA (pH 8.0) for five minutes; 
* (2) 0.5 M NaOM for ten minutes; and 
(3) 1 .0 M Tris-HCI (pH 7.5) for three minutes. 

40 The filters were then air dried in a vacuum over at 80°C for two hours. 

The filters were then subjected to Proteinase K digestion through treatment with a solution containing 50 micro- 
grams/ml of the protease enzyme in Buffer K (0.1M Tris-HCI (pH 8.0)— 0.15M NaCI— 10 mM EDTA (pH 8.2)— 0.2% 
SDS]. Specifically, 5 ml of the solution was added to each filter arid the digestion was allowed to proceed at 55°C for 
30 minutes, after which the solution was removed. 

45 The filters were then treated wtih 4 ml of a prehybridization buffer (5 x SSPE — 0.5% SDS — 100 microgramms/ 

ml SS E. co// DNA — 5 X BFP). The prehybridization treatment was carried out at 55°C, generally for 4 hours or longer, 
after which the prehybridization buffer was removed. 

The hybridization process was carried out in the following manner. To each filter was added 3 ml of hybridization 
buffer (5 X SSPE — 0.5% SDS — 1 00 micrograms/ml yeast tRNA) containing 0.025 picomoles of each of the 1 28 probe 

so sequences of Table II (the total mixture being designated the EPV mixture) and the filters were maintained at 48°C for 
20 hours. This temperature was 2°C less than the lowest of the calculated dissociation temperatures (Td) determined 
for any of the probes. 

Following hybridization, the filters were washed three times for ten minutes on a shaker with 6 X SSC— 0.1 % SDS 
at room temperature and washed two to three times with 6 x SSC-1% SDS at the hybridization temperature (48°C). 
55 Autoradiography of the filters revealed seven positive clones among the 200,000 colonies screened. 

Initial sequence analysis of one of the putative monkey cDNA clones (designated clone 83) was performed for 
verification purposes by a modification of the procedure of Wallace, at al., Gene, 16, pp. 21—26 (1 981 ). Briefly, plasmid 
DNA from monkey cDNA clone 83 was linearized by digestion with EcoRI and denatured by heating in a boiling water 
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bath. The nucleotide sequence was determined by the dideoxy method of Sanger, at al., RN.A.S. (U.S.A.) 74, pp. 
5463 — 5467 (1 977). A subset of the EPV mixture of probes consisting of 1 6 sequences was used as a primer for the 
sequencing reactions. 

5 C. Monkey EPO cDNA Sequencing 

Nucleotide sequence analysis of clone 83 was carried out by the procedures of Messing, Methods in Enzymofogy 
101, pp. 20 — 78 (1983). Set out in Table IV is a preliminary restriction map analysis of the approximately 1600 base 
pair Ecofll/H/nc/lll cloned fragment of clone 83. Approximate locations of restriction endo-nuclease enzyme recognition 
10 sites are provided in terms of number of bases 3' to the EcoR site at the 5* end of the fragment. Nucleotide sequencing 
was carried out by sequencing individual restriction fragments with the intent of matching overlapping fragments. For 
example, an overlap of sequence information provided by analysis of nucleotides in a restriction fragment designated 
C113 (Sai/3A at ~111/Smal at -324) and the reverse order sequencing of a fragment designated C73 (AM at 
~424/Bsfai at -203). 

75 

TABLE IV 





Restriction Enzyme 


Recoanition Site 


rr laic uwi^auuil^o^ 


20 


r-_.ni 

fcCORI 


1 




Sau3A 


111 




Smal 


180 




BstEII 


203 


25 


Smal 


324 




Kpnl 


371 




Rsal 


372 




Alul 


424 




Pstl 


426 


30 


Atui 


HOW 




Hpal 


466 




Alul 


546 




Pstl 


601 


35 


Pvull 


604 


Alul 


605 




Alul 


782 




Alul 


788 




Rsal 


792 


40 


Pstl 


807 




Alul 


841 




Alul 


927 




Ncot 


946 


45 


Sau3A 


1014 


Alul 


1072 




Alul 


1115 




Alul 


. 1223 




Pstl 


1301 


50 


Rsal 


1343 




A{ul 


1384 




Hindlll 


1449 




Alul 


1450 




Hindlll 


1585 


55 





Sequencing of approximately 1342 base pairs (within the region spanning the Sau3A site 3' to the Ecorl site and 
the HindtU site) and analysis of all possible reading frames has allowed for the development of DNA and amino acid 
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sequence information set out in Table V. In the Table, the putative initial amino acid residue of the amino terminal of 
mature EPO (as verified by correlation to the previously mentioned sequence analysis of twenty amino terminal resi- 
dues) is designated by the numeral +1. The presence of a methionine-specifying ATG codon (designated -27) "up- 
stream") of the initial amino terminal alanine residue as the first residue designated for the amino acid sequence of 
5 the mature protein is indicative of the likelihood that EPO is initially expressed in the cytoplasm in a precursor form 
including a 27 amino acid "leader" region which is excised prior to entry of mature EPO into circulation. Potential 
glycosylation sites within the polypeptide are designated by asterisks. The estimated molecular weight of the translated 
region was determined to be 21 ,117 daltons and the M.W. of the 165 residues of the polypeptide constituting mature 
monkey EPO was determined to be 1 8,236 daltons. 

10 
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TABLE V 

Translation of Monkey EPO cDNA 



Htcccgcgccccctggacagccgccctctcctccaggcccgtgggcctggccctgccc 

CGCTGAACTTCCCGGGATGAGGACTCCCGGTGTGGTCACCGCGCGCCTAGGTCGCTGAG 

_27 -20 
Met Gly Val His Glu Cys Pro Ala Trp 
GGACCCCGGCCAGGCGCGGAGATG GGG GTG CAC GAA TGT CCT GCC TGG 

-10 

i eu Tro Leu Leu Leu Ser Leu Val Ser Leu Pro Leu Gly Leu Pro 
CJC TGG CTT CTC CTG TCT CTC GTG TCG CTC CCT CTG GGC CTG CCA 

-l +i 10 • ; 

val Pro Gly Ala Pro Pro Arg Leu lie Cys Asp Ser Arg Val Leu 
GTC CCG GGC GCC CCA CCA CGC CTC ATC TGT GAC AGC CGA GTC CTG 

20 * 
pin Arn Tvr Leu Leu Glu Ala Lys Glu Ala Glu Asn Val Thr Met 
gIg JgG TAC CTC TTG GAG GCC AAG GAG GCC GAG AAT GTC ACG ATG 



30 



40 



riu rvs Ser Glu Ser Cys Ser Leu asn Glu Asn He Thr Val Pro 
GGC TGT TCC GaS AGC TGC AGC TTG AAT GAG AAT ATC ACC GTC CCA 



50' 

Asp Thr Lys Val Asn Phe Tyr Ala Trp Lys Arg Met Glu Val Gly 
GAC ACC AAA GTT AAC TTC TAT GCC TGG AAG AGG ATG GAG GTC GGG 

60 70 
rin Gin Ala Val Glu Val Trp Gin Gly Leu Ala Leu Leu Ser Glu 
cJS CaS GC? GTA GAA GTC TGG CAG GGC CTG GCC CTG CTC TCA GAA 

80 * 

Ala Val Leu Arg Gly Gin Ala Val Leu Ala Asn Ser Ser Gin Pro 
GCT GTC CTG CGG GGC CAG GCC GTG TTG GCC AAC TCT TCC CAG CCT 



90 



100 



Phe Glu Pro Leu Gin Leu His Met Asp Lys Ala lie Ser Gly Leu 
TTC GAG CCC CTG CAG CTG CAC ATG GAT AAA GCC ATC AGT GGC CTT 

110 

Aro Ser lie Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Glu Ala 
CGC AGC ATC ACC ACT CTG CTT CGG GCG CTG GGA GCC CAC GAA GCC 
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TABLE V <conttnuted) 



lie Ser Leo Prp Asp Ala Ala s*»r ai» m « 130 

ATC TCC CTC CCA «f CCC & g-gf & £» J" & J£ -U 

140 

Thr Ala Asp Thr Phe Cys Lys Leu Ph» &m u«i t 

ACT OCT GAC ACT TTC TCC Aa'a CTc" "till G?C TflC % !?? ffi 



150 



8? a & us a &b ass is; & & &• %-a-a 

165 

Gly Asp Arg OP 

GGG GAC AGA TGA CCAGGTGCGTCCAGCTGGGCACATCCACCACCTCCCTCACCAACA 
CTGCCTGTGCCACACCCTCCCTCACCACTCCCGAACCCCATCGAGGGGCTCTCAGCTAAG 

CGCCAGCCTGTCCCATGGACACTCCAGTGCCAGCAATGACATCTCAGGGGCCAGAGGAAC 
TGTCCAGAGCACAACTCTGAGATCTAAGGATGTCGCAGGGCCAACTTGAGGGCCCAGAGC 
AGGAAGCATTCAGAGAGCAGCTTTAAACTCAGGAGCAGAGACAATGCAGGGAAAACACCT 
GAGCTCACTCGGCCACCTGCAAAATTTGATGCAGGACACGCTTTGGAGGCAATTTACCTG 
TTTTTGCACCTACCATCAGGGACAGGATGACTGGAGAACTTAGGTGGCAAGCTGTGACTT 
CTCAAGGCCTCACGGGCACTCCCTTGGTGGCAAGAGCCCCCTTGACACTGAGAGAATATT 
TTGCAATCTGCAGCAGGAAAAATTACGGACAGGTTTTGGAGGTTGGAGGGTACTTGACAG 

GTGTGTGGGGAAGCAGGGCGGTAGGGGTGGAGCTGGGATGGGAGTGAGAACCGTGAAGAC 

AGGATGGGGGCTGGCCTCTGGTTCTCGTGGGGTCCAAGCTT 

Hindlli 



105— ISPnsa^anH/nrrh,,,. o. j o- i. I PP 3824-3828 < 1981 ) and Kyte at al., J.Mol.Biol., 157, pp 
^ Chou.atal, aacfcm. 14 pp. 222-245(1 974) and Advances mEnzyn*>logy,47 PP 45-47 

PEP Reference Section 6.7 made avaibble by Intelligences, Inc., 124 University Avenue, Palo Alto Sornia 
Example 4 

A. Human Genomic Library 

18 £^£5^™ ^ atal Cell 

18, pp. 533-543 (1 979) was obta.ned and maintained for use in a plaque hybridization assay. ' 
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B. Plaque Hybridization Procedures For Screening Human Genomic Library 

Phage particles were lysed and the DNAs were fixed on filters (50,000 plaques per filter) according to the proce- 
dures of Woo, Methods In Enzymology, 68, pp. 389—395 (1979) except for the use of GeneScreen Plus filters (New 
England Nuclear Catalog No. NEF — 976) and NZYAM plates (NaCI, 5 g; MgCI 2 — 6H 2 0, 2 g; NZ-Amine A, 10 g; yeast 
extract, 5 g; casamino acids, 2 g; maltose; 2 g; and agar, 15 g per liter). 

The air-dried filters were baked at 80°C for 1 hour and then digested with Proteinase K as described in Example 
3, Part B. Prehybridization was carried out with a 1 M NaCI — 1 % SDS buffer for 55°C for 4 hours or more, after which 
the buffer was removed. Hybridization and post-hybridization washings were carried out as described in Example 3 
Part B. Both the mixture of 128 20-mer probes designated EPV and the mixture of 128 17-mer probes of Table III 
(designated the EPQ mixture) were employed. Hybridization was carried out at 48°C using the EPV probe mixture 
EPQ probe mixture hybridization was carried out at 46°C— 4 degrees below the lowest calculated Td for members of 
the mixture. Removal of the hybridized probe for rehybridization was accomplished by boiling with 1 X SSC — 0.1 % 
SDS for two minutes. Autoradiography of the filters revealed three positive clones (reactive with both probe mixtures) 
among the 1 ,500,000 phage plaques screened. Verification of the positive clones as being EPO-encoding was obtained 
through DNA sequencing and electron micrographic visualization of heteroduplex formation with the monkey cDNA of 
Example 3. This procedure also gave evidence of multiple introns in the genomic DNA sequence. 

Examples 

Nucleotide sequence analysis of one of the positive clones (designated AhEl) was carried out and results obtained 
to date are set out in Table VI. 
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In Table VI, the initial continuous DNA sequence designates a top strand of 620 bases in what is apparently an 
untranslated sequence immediately preceding a translated portion of the human EPO gene. More specifically, the 
sequence appears to comprise the 5' end of the gene which leads up to a translated DNA region coding for the first 
four amino acids (-27 through -24) of a leader sequence ("presequence"). Four base pairs in the sequence prior to that 
encoding the beginning of the leader have not yet been unambiguously determined and are therefore designated by 
an "X". There then follows an intron of about 639 base pairs (439 base pairs of which have been sequenced and the 
remaining 200 base pairs of which are designated "I.S.") and immediately preceding a codon for glutamine which has 
been designated as residue -23 of the translated polypeptide. The exon sequence immediately following is seen to 
code for amino acid residues through an alanine residue (designated as the +1 residue of the amino acid sequence 
of mature human EPO) to the codon specifying threonine at position +26, whereupon there follows a second intron 
consisting of 256 bases as specifically designated. Following this intron is an exon sequence for amino acid residues 
27 through 55 and thereafter a third intron comprising 612 base pairs commences. The subsequent exon codes for 
residues 56 through 1 1 5 of human EPO and there then commences a fourth intron of 1 34 bases as specified. Following 
the fourth intron is an exon coding for residue Nos. 116 through 1 66 and a "stop" codon (TGA). Finally, Table VI identifies 



24 



EP 0148 605 B2 



a sequence of 568 base pairs in what appears to be an untranslated 3' region of the human EPO gene, two base pairs 
of which ("X") have not yet been unambiguously sequenced. 

Table VI thus serves to identify the primary structural conformation (amino acid sequence) of mature human EPO 
as including 166 specified amino acid residues (estimated M.W. = 18,399). Also revealed in the Table is the DNA 

s sequence coding for a 27 residue leader sequence along with 5' and 3' DNA sequences which may be significant to 
promoter/operator functions of the human gene operon. Sites for potential glycosylation of the mature human EPO 
polypeptide are designated in the Table by asterisks. It is worthy of note that the specific amino acid sequence of Table 
VI likely constitutes that of a naturally occurring allelic form of human erythropoietin. Support for this position is found 
in the results of continued efforts at sequencing of urinary isolates of human erythropoietin which provided the finding 

10 that a significant number of erythropoietin molecules therein have a methionine at residue 1 26 as opposed to a serine 
as shown in the Table. 

Table VII, below, illustrates the extent of polypeptide sequence homology between human and monkey EPO. In 
the upper continuous line of the Table, single letter designations are employed to represent the deduced translated 
polypeptide sequences of human EPO commencing with residue -27 and the lower continuous line shows the deduced 

'5 polypeptide sequence of monkey EPO commencing at assigned residue number -27. Asterisks are employed to high- 
light the sequence homologies. It should be noted that the deduced human and monkey EPO sequences reveal an 
"additional" lysine (K) residue at (human) position 116. Cross-reference to Table VI indicates that this residue is at the 
margin of a putative mRNA splice junction in the genomic sequence. Presence of the lysine residue in the human 
polypeptide sequence was further verified by sequencing of a cDNA human sequence clone prepared from mRNA 

20 isolated from COSP— 1 cells transfored with the human genomic DNA in Example 7, infra. 
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Example 6 



The expression system selected for initial attempts at microbial synthesis of isbiatable quantities of EPO polypep- 
tide material coded for by the monkey cDNA provided by the procedures of Example 3 was one involving mammalian 
55 host cells (i.e., COS — 1 cells, A.T.C.C. No. CRL— 1650). The cells were transfected with a "shuttle" vector capable of 
autonomous replication in E.coli host (by virtue of the presence of pBR322-derived DNA) and the mammalian hosts 
(by virtue of the presence of SV40 virus-derived DNA). 

More specificalty, an expression vector was constructed according to the following procedures. The plasmid clone 
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83 provided in Example 3 was amplified in Eatf and the approximately 1 .4 kb monkey EPO-encoding DNA was isolated 
by EcoRI and HindM digestion. Separately isolated was an approximately 4.0 kb. Hind\\/SaH fragment from pBR322 
An approximately 30 bp, EcoR/SaH "linker" fragment was obtained from Mf3mp10 RF DNA (P and L Laboratories) 
This linker included, in series, an EcoRI sticky end, followed by Ss&.SmaX, BamH\ and Xbal recognition sites and a 
Sal sticky end. The above three fragments were ligated to provide an approximately 5.4 kb intermediate plasmid 
( pERS") wherein the EPO DNA was flanked on one side by a "bank" of useful restriction endonuclease recognition 
sites. pERS was then digested with HindU and Sa< to yield the EPO DNA and the EcoRI to San (M13mp10) linker. 
The 1 .4 kb fragment was ligated with an approximately 4.0 kb SamHI/SaH of pBR322 and another M13mpi0 Hind\\\/ 
BarrHl RF fragment linker also having approximately 30 bp. The M13 linker fragment was characterized by a HindU 
sticky end. followed by Pstt, SaA, Xoal recognition sites and a Baml sticky end. The ligation product was. again a 
useful intermediate plasmid ("pBR-EPO") including the EPO DNA flanked on both sides by banks of restriction site 
The vector chosen for expression of the EPO DNA in COS-1 cells ("pDSVLI •) had previously been constructed 
to allow for selection and autonomous replication in E.coli. These characteristics are provided by the origin of replication 
and Ampicillin resistance gene DNA sequences present in the region spanning nucleotides 2448 through 4362 of 
PBR322. This sequence was structurally modified by the addition of a linker providing a HindW recognition immediately 
adjacent nucleotide 2448 prior to incorporation into the vector. Among the selected vector's other useful properties 
was the capacity to autonomously replicate in COSM cells and the presence of a viral promoter sequence functional 
in mammalian cells. These characteristics are provided by the origin of replication DNA sequence and "late gene" viral 
promoter DNA sequence present in the 342 bp sequence spanning nucleotide numbers 5171 through 270 of the SV40 
genome. A unique restriction site (BamHi; was provided jn the vectQr and immediately adjacent the viral promoter 
sequence through use of a commercially available linker sequence (Collaborative Research). Also incorporated in the 
vector was a 237 base pair sequence (derived as nucleotide numbers 2553 through 2770 of SV40) containing the "late 
gene" viral mRNA polyadenylation signal (commonly referred to as a transcription terminator). This fragment was po- 
sitioned in the vector in the proper orientation vis-a-vis the "late gene" viral promoter via the unique BamHI site Also 
present in the vector was another mammalian gene at a location not material to potential transcription of a gene inserted 
at the unique SamHI site, between the viral promoter and terminator sequences. [The mammalian gene comprised an 
approximately 2.500 bp mouse dihydrofolate reductase (DHFR) minigene isolated from plasmid pMG-1 as in Gasser 
at aL. P.N. AS. (US A,, 79, pp. 6522-6526, (1982).] Again, the major operative components of plasmid pDSVLI 
comprise nucleotides 2448 through 4362 of pBR322 along with nucleotides 5171 through 270 (342 bp) and 2553 
30 through 2770 (237 bp) of SV40 DNA. . 

Following procedures described, e.g., in Maniatis. at al.. supra, the EPO-encoding DNA was isolated from plasmid 
pBR-EPO as a BamHI fragment and ligated into plasmid pDSVLI cut with BamHI. Restriction enzyme analysis was 
employed to confirm insertion of the EPO gene in the correct orientation in two of the resulting cloned vectors (duplicate 
vectors H and L). See Figure 2. illustrating plasmid pDSVL — MkE. Vectors with EPO genes in the wrong orientation 
were saved for use as negative controls in transfection experiments designed to determine EPO expression levels in 
hosts transformed with vectors having EPO DNA in the correct orientation. 

Vectors H, L. F. X and G were combined with carrier DNA (mouse liver and spleen DNA) were employed to transf ect 
duplicate 60 mm plates by calcium phosphate microprecipitate methods. Duplicate 60 mm plates were also transfected 
with carrier DNA as a "mock" transformation negative control. After five days all culture media were tested for the 
presence of polypeptides possessing the immunological properties of naturally-occurring EPO. 

Example 7 

A. Initial EPO Expression System Involving COS— 1 Cells 

The system selected for initial attempts at microbial synthesis of isolatable quantities of human EPO polypeptide 
materia coded for by the human genomic DNA EPO clone, also involved expression in mammalian host cells (ie 
COS-1 cells, A.T.C.C. No. CRL-1650). The human EPO gene was first sub-cloned into a "shuttle" vector which is 
capable of autonomous replication in both E.coli hosts (by virtue of the presence of pBR322 derived DNA) and in the 
mammalian cell line COS— 1 (by virtue of the presence of SV40 virus derived DNA). The shuttle vector, containing the 
EPO gene, was then transfected into COS-1 cells. EPO polypeptide material was produced in the transfected cells 
and secreted into the cell culture media. 

More specifically, an expression vector was constructed according to the following procedures. DNA isolated from 
lambda clone XhE1. containing the human genomic EPO gene, was digested with BamHI and HindW restriction endo- 
nuclease.s, and a 5.6 Kb DNA fragment known to contain the entire EPO gene was isolated. This fragment was mixed 
and ligated with the bacterial plasmid pUC8 (Bethesda Research Laboratories, Inc.) which had been similarly digested 
creating the intermediate plasmid "pUC8-HuE", providing a convenient source of this restriction fragment 

The vector chosen for expression of the EPO DNA in COS-1 cells (pSV4SEt) had previously been constructed 
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PlasmkJ pSV4SEt contained DNA sequences allowing selection and autonomous replication in E.coli These charac- 
teristics are provided by the origin of replication and Ampicillin resistance gene DNA sequences present in the reqion 
spanning nucleotides 2448 through 4362 of the bacterial plasmid pBR322. This sequence was structurally modified 
by the addition of a linker providing a HindW recognition site immediately adjacent to nucleotide 2448 Plasmid pSV4SEt 
was also capable of autonomous replication in COS— 1 cells. This characteristic was provided by a 342 bp fragment 
containing the the SV40 virus origin of replication (nucleotide numbers 5171 through 270). This fragment has been 
modified by the addition of a linker providing an EcoRI recognition site adjacent to nucleotide 270 and a linker providing 
a Sa/I recognition site adjacent nucleotide 51 71 . A 1061 bp fragment of SV40 was also present in this vector (nucleotide 
numbers 1 711 through 2772 plus a linker providing a Sat recognition site next to nucleotide number 2772) Within this 
fragment was an unique BamH\ recognition sequence. In summary, plasmid pSV4SEt contained unique BamHI and 
Hind i recognrtion sites, allowing insertion of the human EPO gene, sequences allowing replication and selection in 
E.coli, and sequences allowing replication in COS— 1 cells. 

In order to insert the EPO gene into pSV4SEt, plasmid pUC8-HuE was digested with BamHI and HindW restriction 
endonucleases and the 5.6 kb EPO encoding DNA fragment isolated, pSV4SEt was also digested with BamHI and 
HindW and the major 2513 bp fragment isolated (preserving all necessary functions). These fragments were mixed 
and I (.gated, creating the final vector -pSVgHuEPO". (See, Figure 3.) This vector was propagated in Eco/Zand vector 
isolated. Restriction enzyme analysis was employed to confirm insertion of the EPO gene 
Plasmid pSVgHuEPO DNA was used to express human EPO polypeptide material in COS— 1 cells More specif- 
ically, pSVgHuEPO DNA was combined with carrier DNA and transfected into triplicate 60 mm plates of COSP-1 cells 
As a control, carrier DNA alone was also transfected into COS— 1 cells. Cell culture media were sampled five and 
seven days later and tested for the presence of polypeptides possessing the immunological properties of naturally 
occurring human EPO. y 

B. Second EPO Expression System Involving COS— 1 Cells 

Still another system was designed to provide improved production of human EPO polypeptide material coded bv 
the human genomic DNA EPO clone in COS— 1 cells (A.T.C.C. No. CRL— 1650) 

,h* lT^T e ^ W ^f m9 SyS,em ' EP ° W3S ex P fessed in cos - 1 "Us using its own promoter which is within 
the 5.6 Kb BamHI to Htndll restriction fragment. In the following construction, the EPO gene is altered to that it is 
30 expressed using the SV40 late promoter. 

More specifically, the cloned 5.6 Kb BamHI to HindW genomic human EPO restriction fragment was modified by 
he following procedures. Plasmid pUCB-HuE, as described above, was cleaved with BamHI and with BstB\ restric- 
tion endonucleases. BstBt cleaves wrthin the 5.6 Kb EPO gene at a position which is 44 base pairs 5' to the initiating 
?I£ C u 9 Pre-Peptide and approximately 680 base pairs 3 1 to the Hindu restriction site. The approximately 
4900 base pair fragment was isolated. A synthetic linker DNA fragment, containing Sail and Bsffll sticky ends and an 
To^oo recoan,,lon si,e w as synthesized and purified. The two fragments were mixed and ligated with plasmid 
PBR322 which had been cut with Sail and BamHI to produce the intermediate plasmid pBRgHE. The genomic human 
EPO gene can be isolated therefrom as a 4900 base pair BamHI digestion fragment carrying the complete structural 
gene with a single ATG 44 base pairs 3' to BamHI site adjacent the amino terminal coding region 
n J«'f L ra9me u nt was isolated and inserted as a Bam H\ fragment into BamHI cleaved expression vector plasmid 
pDSVLI (described in Example 6). The resulting plasmid, pSLVgHuEPO, as illustrated in Figure 4, was used to express 
EPO polypeptide material from COS— 1 cells, as described in Examples 6 and 7A. 

Example 8 

Culture media from growth of the six transfected COS— 1 cultures of Example 6 were analyzed by radioimmu- 
noassay according to the procedures set forth in Example 2. Part B. Each sample was assayed at 250 125 50 and 
25 microliter ahquot levels. Supernatants from growth of cells mock transfected or transfected with vectors having 
incorrect EPO gene orientation were unambiguously negative for EPO immunoreactivity. For each sample of the two 
supernatants derived from growth of COS— 1 cells transfected with vectors (H and L) having the EPO DNA in the 
correct orientation, the % inhibition of ^l-EPO binding t0 antibody ranged (rom 72 , Q 88% whjch p|aces a|| vg|ues 
at the top of the standard curve. The exact concentration of EPO in the culture supernatant could not then reliably be 
estimated. A quite conservative estimate of 300 mU/ml was made, however, from the value calculation of the largest 
aliquot size (250 microliter). »me«>i 

A representative culture fluid according to Example 6 and five and seven day culture fluids obtained according to 
Example 7A were tested in the RIA in order to compare activity of recombinant monkey and human EPO materials to 
a naturally-occurring human EPO standard and the results are set out in graphic form in Figure-1. Briefly the results 
expectedly revealed that the recombinant monkey EPO significantly competed for anti-human EPO antibody although 
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it was not able to completely inhibit binding under the test conditions. The maximum percent inhibition values for re- 
combinant human EPO, however, closely approximated those of the human EPO standard. The parallel nature of the 
dose response curves suggest immunological identity of the sequences (epitopes) in common. Prior estimates of mon- 
key EPO in culture fluids were re-evaluated at these higher dilution levels and were found to range from 2.91 to 3.12 
s U/ml. Estimated human EPO production levels were correspondingly set at 302 mU/ml for the five-day growth sample 
and 567 mU/ml for the seven day growth sample. Estimated monkey EPO production levels in the Example 78 expres- 
sion system were on the same order or better. 

Example 9 

10 

Culture fluids prepared according to Examples 6 and 7 were subjected to an in vitro assay for EPO activity according 
to the procedure of Goldwasser, at al., Endocrinology, 97, 2, pp. 31 5 — 323 (1 975). Estimated monkey EPO values for 
culture fluids tested ranged from 3.2 to 4.3 U/ml. Human EPO culture fluids were also active in this in vitro assay and, 
further, this activity could be neutralized by anti-EPO antibody. The recombinant monkey EPO culture fluids according 
'5 to Example 6 were also subjected to an assay for in vivo biological activity according to the general procedures of 
Cotes, atal., Nature, 191, pp. 1065— 1067 (1961) and Hammond, at al., Ann.N. Y.Acad.Sci, 149, pp. 516—527 (1968) 
and activity levels ranged from 0.94 to 1 .24 U/ml. 

Example 10 

20 

In the previous examples, recombinant monkey or human EPO material was produced from vectors used to trans- 
feet COS — 1 cells. These vectors replicate in COS — 1 cells due to the presence of SV40 T antigen within the cell and 
an SV40 origin of replication on the vectors. Though these vectors produce useful quantities of EPO in COS — 1 cells, 
expression is only transient (7 to 14 days) due to the eventual loss of the vector. Additionally, only a small percentage 

25 of COS — 1 became productively transfected with the vectors. The present example describes expression systems 
employing Chinese hamster ovary (CHO) DHFR- cells and the selectable marker, DHFR. [For discussion of related 
expression systems, see U.S. Letters Patent No. 4,399,216 and European Patent Applications 117058, 117059 and 
1 1 7060, all published August 29, 1 984.] 

CHO DHFR- cells (DuX— B11) CHD K1 cells, Urlaub, at al., Proc. Nat Acad. Sci. (U.S.A.), Vol. 77, 4461 (1980) 

30 lack the enzyme dihydrofolate reductase (DHFR) due to mutations in the structural genes and therefore require the 
presence of glycine, hypoxanthine, and thymidine in the culture media; Plasmids pDSVL-MkE (Example 6) or pDSVL- 
gHuEPO (Example 7B) were transfected along with carrier DNA into CHO DHFR- cells growing in media containing 
hypoxanthine, thymidine, and glycine in 60 mm culture plates. Plasmid pSVgHuEPO (Example 7A) was mixed with 
the plasmid pMG2 containing a mouse dihydrofolate reductase gene cloned into the bacterial plasmid vector pBR322 

35 . (per Gasser, at al., supra.) The plasmid mixture and carrier DNA was transfected into CHO DHFR cells. (Cells which 
acquire one plasmid will generally also acquire a second plasmid). After three days, the cells were dispersed by trypsini- 
zation into several 100 mm culture plates in media lacking hypoxanthine and thymidine. Only those cells which have 
been stably transformed with the DHFR gene, and thereby the EPO gene, survive in this media. After 7—21 days, 
colonies of surviving cells became apparent. These transformant colonies, after dispersion by trypsinization can be 

40 continuously propagated in media lacking hypoxanthine and thymidine, creating new cell strains (e.g., CHO pDSV- t 
L — MkEPO, CHO pSVgHuEPO, CHO — pDSVL — gHuEPO). 

Culture fluids from the above cell strains were tested in the RIAfor the presence of recombinant monkey or human 
EPO. Media for strain CHO pDSVL — MkEPO contained EPO with immunological properties like that obtained from 
COS — 1 cells transfected with plasmid pDSVL— MkEPO. A representative 65 hour culture fluid contained monkey 

45 EPO at 0.60 U/ml. 

Culture fluids from CHO pSVgHuEPO and CHO pDSVL-gHuEPO contained recombinant human EPO with immu- 
nological properties like that obtained with COS — 1 cells transfected with plasmid pSVgHuEPO or pDSVL — gHuEPO. 
A representative 3 day culture fluid from CHO pSVgHuEPO contained 2.99 U/ml of human EPO and a 5.5 day sample 
from CHO pDSVL— gHuEPO had 18.2 U/ml of human EPO as measured by the RIA. 

50 The quantity of EPO produced by the cell strains described above can be increased by gene amplification giving 

new cell strains of greater productivity. The enzyme dihydrofolate reductase (DHFR) which is the product coded for by 
the DHFR gene can be inhibited by the drug methotrexate (MTX). More specifically, cells propagated in media lacking 
hypoxanthine and thymidine are inhibited or killed by MTX. Under the appropriate conditions, (e.g., minimal concen- 
trations of MTX) cells resistant to and able to grown in MTX can be obtained. These cells are found to be resistant to 

55 MTX due to an amplification of the number of their DHFR genes, resulting in increased production of DHFR enzyme. 
The surviving cells can, in turn, be treated with increasing concentrations of MTX, resulting in cell strains containing 
greater numbers of DHFR genes. "Passenger genes" (e.g., EPO) carried on the expression vector along with the DHFR 
gene or transformed with the DHFR gene are frequently found also to be increased in their gene copy number. 
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As examples of practice of this amplification system, cell strain CHO pDSVL — MkE was subjected to increasing 
MTX concentrations (0 nM, 30 nM and 100 nM). Representative 65-hour culture media samples from each amplification 
step were assayed by Rl A and determined to contain 0.60, 2.45 and 6. 1 0 U/hnl, respectively. Cell strain CHO pDSV- 
L— gHuEPO was subjected to a series of increasing MTX concentrations of 30 nM, 50 nM, 100 nM, 200 nM, 1 jiM, 

5 and 5 u.M MTX. A representative 3-day culture media sample from the 100 nM MTX step contained human EPO at 
3089 ± 129 u/mtas judged by RIA. Representative 48' hour cultural medium samples from the 100 nM and 1 |iM MTX 
steps contained, respectively, human EPO at 466 and 1352 U/ml as judged by RIA (average of triplicate assays). In 
these procedures, 1 X 10 6 cells were plated in 5 ml of media in 60 mm culture dishes. Twenty-four hours later the 
media were removed and replaced with 5 ml of serum-free media (high glucose DMEM supplemented with 0.1 mM 

10 non-essential amino acids and L-glutamine). EPO was allowed to accumulate for 48 hours in the serum-free media. 
The media was collected for RIA assay and the cells were trypsinized and counted. The average RIA values of 467 
U/ml and 1 352 U/ml for cells grown at 100 nM and 1 uJvl MTX, respectively, provided actual yields of 2335 U/plate and 
6750 U/plate. The average cell numbers per plate were 1.94 x 10 6 and 3.12 X 10 6 cells, respectively. The effective 
production rates for these culture conditions were thus 1264 and 2167 U/10 6 cells/48 hours. 

is The cells in the cultures described immediately above are a genetically heterogeneous population. Standard 

screening procedures are being empbyed in an attempt to isolate genetically homogeneous clones with the highest 
production capacity. See, Section A, Part 2, of "Points to Consider in the Characterization of Cell Lines Used to Produce 
Biologies', June 1, 1984, Office of Biologies Research Review, Center for Drugs and Biologies, U.S. Food and Drug 
Administration. 

20 The productivity of the EPO producing CHO cell lines described above can be improved by appropriate cell culture 

techniques. The propagation of mammalian cells in culture generally requires the presence of serum in the growth 
media. A method for production of erythropoietin from CHO cells in media that does not contain serum greatly facilitates 
the purification of erythropoietin from the culture medium. The method described below is capable of economically 
producing erythropoietin in serum-free media in large quantities sufficient for production. 

25 Strain CHO pDSVL— gHuEPO cells, grown in standard cell culture conditions, are used to seed spinner cell culture 

flasks. The cells are propagated as a suspension cell line in the spinner cell culture flask in media consisting of a 
50—50 mixture of high glucose DMEM and Ham's F1 2 supplemented with 5% fetal calf serum, L-glutamine, Penicillin 
and Streptomycin, 0.05 nM non-essential amino acids and the appropriate concentration of methotrexate. Suspension 
cell culture allows the EPO-producing CHO cells to be expanded easily to large volumes. CHO cells, grown in sus- 

30 pension, are used to seed roller bottles at an initial seeding density of 1 .5 X 10 7 viable cells per 850 cm 2 roller bottle 
in 200 ml of media. The cells are allowed to grow to confluency as an adherent cell line over a three-day period. The 
media used for this phase of the growth is the same as used for growth in suspension. At the end of the three-day 
growth period, the seriim containing media is removed and replaced with 100 ml of serum-free media; 50 — 50 mixture 
of high glucose DMEM and Ham's F12 supplemented with 0.05 mM non-essential amino acids and L-glutamine. The 

35 roller bottles are returned to the roller bottle incubator for a period of 1-^-3 hours and the media again is removed and 
replaced with 100 ml of fresh serum-free media. The 1 — 3 hour incubation of the serum-free media reduces the con- 
centration of contaminating serum proteins. The roller bottles are returned to the incubator for seven days during which 
erythropoietin accumulates in the serum-free culture media. At the end of the seven-day production phase, the condi- 
tioned media is removed and replaced with fresh serum-free medium for a second production cycle. As an example 

^0 of the practice of this production system, a representative seven-day, serum-free media sample contained human 
erythropoietin at 3892 ± 409 Ulml as judged by the RI A Based on an estimated cell density of 0.9 to 1 .8 x 1 0 5 cells/ 
cm 2 , each 850 cm 2 roller bottle contained from 0.75 to 1 .5 x 10 8 cells and thus the rate of production of EPO in the 
7-day, 1 00 ml culture was 750 to 1 470 H/1 0 6 cells/48 hours. 

Culture fluids from cell strain CHO pDSVL— MkEPO carried in 10 nM MTX were subjected to RIA in vitro and in 

45 V i V o EPO activity assays. The conditioned media sample container 41.2 ± 1.4 U/ml of MkEPO as measured by the 
RIA, 41 .2 ± 0.064 U/ml as measured by the in vitro biological activity assay and 42.5 ± 5 U/ ml as measured by the in 
vivo biological activity assay. Amino acid sequencing of polypeptide products revealed the presence of EPO products, 
a principle species having 3 residues of the "leader" sequence adjacent the putative amino terminal alanine. Whether 
this is the result of incorrect membrane processing of the polypeptide in CHO cells or reflects a difference in structure 

50 of the amino terminus of monkey EPO vis-a-vis human EPO, is presently unknown. 

Culture fluids from cell strain CHO pDSVL— gHuEPO were subjected to the three assays. A 5.5 day sample con- 
tained recombinant human EPO in the media at a level of 18.2 U/ml by RIA assay, 15.8 ± 4.6 U/ ml by in vitro assay 
and 16.8 + 3.0 U/ml by in vivo assay. 

Culture fluid from CHO pDSVL-gHuEPO cells prepared amplified by stepwise 100 nM MTX were subjected to the 

55 three assays. A 3.0 day sample contained recombinant human EPO at a level of 3089 ± 1 29 U/ml by RIA, 2589 ±71.5 
U/ml by in vitro assay, and 2040 ± 160 U/ml by in vivo assay. Amino acid sequencing of this product reveals an amino 
terminal corresponding to that designated in Table VI. 

Cell conditioned media from CHO cells transfected with plasmid pDSVL — MkE in 10 nM MTX were pooled, and 
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the MTX dialyzed out over several days, resulting In media with an EPO activity of 221 ±5.1 U/ ml (EPO-— CCM). To 
determine the in vivo effect of the EPO— CCM upon hematocrit levels in normal Balb/C mice, the following experiment 
was conducted. Cell conditioned media from untransfected CHO cells (CCM) and EPO — CCM were adjusted with 
PBS. CCM was used for the control group (3 mice) and two dose levels of EPO— CCM — 4 units per injection and 44 
units per injection — were employed for the experimental groups (2 mice/group). Over the course of 5 weeks, the 
seven mice were injected intraperitoneally, 3 times per week. After the eighth injection, average hematocrit values for 
the control group were determined to be 50.4%; for the 4U group, 55. 1%; and, for the 44U group, 67.9%. 

Mammalian cell expression products may be readily recovered in substantially purified form from culture media 
using HPLC (C 4 ) employing an ethanol gradient, preferably at pH7. 

A preliminary attempt was made to characterize recombinant glycoprotein products from conditioned medium of 
COS— 1 and CHO cell expression of the human EPO gene in comparison to human urinary EPO isolates using both 
Western blot analysis and SDS— PAGE. These studies indicated that the CHO-produced EPO material had a some- 
what higher molecular weight than the COS— 1 expression product which, in turn, was slightly larger than the cooled 
source human urinary extract. All products were somewhat heterogeneous. Neuraminidase enzyme treatment to re- 
move sialic acid resulted in COS— 1 and CHO recombinant products of approximately equal molecular weight which 
were both nonetheless larger than the resulting asialo human urinary extract. Endoglycosidase F enzyme (EC 3.2.1) 
treatment of the recombinant CHO product and the urinary extract product to totally remove carbohydrate from both) 
resulted in substantially homogeneous products having essentially identical molecular weight characteristics. 

Glycoprotein products provided by the present invention are thus comprehensive of products having a primary 
structural conformation sufficiently duplicative of that of a naturally-occurring erythropoietin to allow possession of one 
or more of the biological properties thereof and having an average carbohydrate composition which differs from that 
of naturally-occurring erythropoietin. 

i 

Example 11 

The present example relates to the total manufacture by assembly of nucleotide bases of two structural genes 
encoding the human species EPO sequence of Table VI and incorporating, respectively "preferred 1 codons for expres- 
sion in Ecoli and yeast (S.cerevisiae) cells. Also described is the construction of genes encoding analogs of human 
EPO. Briefly stated, the protocol employed was generally as set out in the previously noted disclosure of Alton, et al. 
(WO 83/04053). The genes were designed for initial assembly of component oligonucleotides into multiple duplexes 
which, in turn.were assembled into three discrete sections. These sections were designed for ready amplification and, 
upon removal from the amplification system, could be assembled sequentially or through a multiple fragment ligation 
in a suitable expression vector. 

Tables VIII through XIV below illustrate the design and assembly of a manufactured gene encoding a human EPO 
translation product lacking any leader or presequence but including an initial methionine residue at position -1. More- 
over, the gene incorporated in substantial part E coli preference codons and the construction was therefore referred 
to as the "ECEPO' gene. 
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TABLE VIII 
ECEPO SECTION 1 OLIGONUCLEOTIDES 

1. AATTCTAGAAACCATGAGGGTAATAAAATA 

2. CCATTATTTTATTACCCTCATGGTTTCTAG 

3. ATGGCTCCGCCGCGTCTGATCTGCGAC 

4. CTCGAGTCGCAGATCAGACGCGGCGGAG 

5. TCGAGAGTTCTGGAACGTTACCTGCTG 

6. CTTCCAGCAGGTAACGTTCCAGAACT 

7. GAAGCTAAAGAAGCTGAAAACATC 
8; GTGGTGATGTTTTCAGCTTCTTTAG 
9. ACCACTGGTTGTGCTGAACACTGTTC 

10. CAAAGAACAGTGTTCAGCACAACCA 

11. TTTGAACGAAAACATTACGGTACCG 

12. GATCCGGTACCGTAATGTTTTCGTT 

TABLE IX 
EC EPO SECTION 1 

Xba l 

EcoRI 1 3 

AATTCTAG AAACCATGAG GGTAATAAAA TAWJGGCTCC GCCGCGTCTG 
GATC TTTGGTACTC CCATTATTTT ATTACCTgaGG CGGCGCAGAC 
1 4 

ATCTGCGAc k CGAGA GTTCT GGAACGTTAC CTGCTc lsAAG CTAAAGAAGC 
TAGACGCTGA GCTCjTCAAGA CCTTGCAATG GACGACCTTc] GATTTCTTCG 

- I ?- ,11 

TGAAAACATC jACCACTGGTT GTGCTGAACA CTCTTc |rTTC AACGAAAACA 

ACTTTTGTAG TGGTGpCCAA CACGACTTGT GACAAGAAAC] TTGCTTTTGT 
8 10 1 

Kpn l BamHI 
TTACGGTACC G 
AATGCC ATGG CCTAG 
12 
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TABLE X 


ECEPO 


SECTION 2 OLIGONUCLEOTIDES 


I 

x • 


-at TPPPTflpp ap ap ami Apr t 




ptt a app ttpptptp tpptappp 


J • 


T a ap XX r t appp ttpp a a appt a t 


A 

*♦ • 


1 T CUM I ALG i T 1 UL AAGCG T AG A A 


C 


nriiflPT tpptp a ap a apt apttp a att 


6 • 


CCAAACTTCAACTGCTTGTTGACCAAC 


7 • 


TTGGCAGGGTCTGGCACTGCTGAGCG 


0 • 


GCCTCGCTCAGCAGTGCCAGACCCTG 




AMAMVMY MM MM M VMM M M » M M » A 

AGGCTGTACTGCGTGGCCAGGCA 


1 M. 

10. 


GCAGTGCCTGGCCACGCAGTACA 


11 . 


CTGCTGGTAAACTCCTCTCAGCCGT 


12. 


TTCCCACGGCTGAGAGGAGTTTACCA 


13 . 


MM M A A M M MM V* M M A M M V MM A V M V V M it M 

GGGAACCGCTGCAGCTGCATGTTGAC 


14. 


GCTTTGTCAACATGCAGCTGCAGCGG 


15. 


AAAGCAGTATCTGGCCTGAGATCTG 


16. 


GATCCAGATCTCAGGCCAGATACT 
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TABLE XI 



ECEPO SECTION 2 

EcoRl Kpn l 1 3 

A ATTCGGTACC AGACACCAAG GTlTAACTTCT ACGCTTGGAA ACGTATL 

GCCATGG TCTGTGGTTC CAATT^HAGA TGCGAACCTT TGCATACCTT] 



r bcAA 

HCCTF| 



I , 2 

GTTGGTCAAC AAGCAGTTGA AGT pTGGC AG GGTCTGGCAC TGCTGAGCGfc 

CAACCAGTTG TTCGTCAACT TCAAACCfcTC CCAGACCGTG ACGACTCGCT 
6 r 8 * 

GGCTGTACTG CGTGGCCAGG CAjCTGCTGCT AAACTCCTCT CAGCCGTCGG 

CCGhCATGAC GCACCGGTCC GTGACGllCCA TTTGAGGAGA GTCGGCACCC 
1 in I ii 



13 

AACCGCTGCA 



GCTGCATGTT GACHAAGCAG 



12 

15 

TATCTGCCCT 



Ball I BamHI 
GAGATCTG 



TffcGCGACGT CGACGTACAA CTGTTTCGITC ATAGACCGGA CTCTAGACCTAC 
r 14 1 16 
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TABLE XII 
ECEPO SECTION 3 

1. GATCCAGATCTCTGACTACTCTGC 

2. ACGCAGCAGAGTAGTCAGAGATCTG 

3 . TGCGTGCTCTGGGTGCACAGAAAGAGG 
A. GATAGCCTCTTTCTGTGCACCCAGAGC 

5. CTATCTCTCCGCCGGATGCTGCATCT 

6. CAGCAGATGCAGCATCCGGCGGAGA 

7. GCTGCACCGCTGCGTACCATCACTG 

8. ATCAGCAGTGATGGTACGCAGCGGTG 

9. CTGATACCTTCCGCAAACTGTTTCG 

10. ATACACGAAACAGTTTGCGGAAGGT 

11 . TGTATACTCTAACTTCCTGCGTGGTA 

12. CAGTTTACCACGCAGGAAGTTAGAGT 

13 . AACTGAAACTGTATACTGGCGAAGC 

14. GGCATGCTTCGCCAGTATACAGTTT 

15. ATGCCGTACTGGTGACCGCTAATAG 

16. TCGACTATTAGCGGTCACCAGTAC 
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FA8LS XIII 



ECEPO SECTION 3 



Bamm agi ir 

GA"TCCAGArCTCTG 
GTCMGAGAC 



ACTACTCTGC (rGCGTGCTCT GGGTGCACAG AAAGAGc btA TC TCTCCGCC 
TGATGAGACG ACGCA|ZGAGA CCCACGTGTC TTTCTCCGAT~AG]AGAGGCGG 



GGATGCTGCA TCThcjGCAC CGCTGCGTAC CATCACT Gbt GAT ACCTTCC 
CCTACGACGT AGACGAC^TG GCGACGCATG GTAGTGACGA CTfJTGGAAGG 
& 8 



GCAAACTGTT TCG |TGTATA C TCTAACTTCC TGCGTGGTaJaaCTGAAACTG 
CGTTTGACAA AGCACATAITG AGATTGAAGG ACGCACCATT~TGAaTTTGAC 
10 12 ^ 

■ 15 Sail 

TATACTGGCG AACC ftTCCC G TACTGGTGAC CGCTAATAG 
ATATGACCGC TTCGTACGGjC ATGACCACTG GCGATTATC AGCT 



4 



7 



9 
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TABLE XIV 
ECEPO GENE 

-11 

Xbal MetAla 
CTAG AAACCATGAG GGTAATAAAA TAATGGCTCC GCCGCGTCTG 
TTTGGTACTC CCATTATTTT ATTACCGAGG CGGCGCAGAC 

ATCTGCGACT CGAGAGTTCT GGAACGTTAC CTGC.TGGAAG CTAAAGAAGC 
TAGACGCTGA GCTCTCAAGA CCTTGCAATG GACGACCTTC GATTTCTTCG 



TGAAAACATC ACCACTGGTT GTGCTGAACA CTGTTCTTTG AACGAAAACA 
ACTTTTGTAG TGGTGACCAA CACGACTTGT GACAAGAAAC TTGCTTTTGT 



TTACGGTACC AGACACCAAG GTTAACTTCT ACGCTTGGAA ACGTATGGAA 

AATGCCATGG TCTGTGGTTC CAATTGAAGA TGCGAACCTT TGCATACCTT 

GTTGGTCAAC AAGCAGTTGA AGTTTGGCAG GGTCTGGCAC TGCTGAGCGA 

CAACCAGTTG TTCGTCAACT TCAAACCGTC CCAGACCGTG ACGACTCGCT 



GGCTGTACTG CGTGGCCAGG CACTGCTGGT AAACTCCTCT CAGCCGTGGG 
CCGACATGAC GCACCGGTCC GTGACGACCA TTTGAGGAGA GTCGGCACCC 



AACCGCTGCA GCTGCATGTT GACAAAGCAG TATCTGGCCT GAGATCTCTG 
TTGGCGACGT CGACGTACAA CTGTTTGGTC ATAGACCGGA CTCTAGAGAC 



ACTACTCTGC TGCGTGCTCT GGGTGCACAG AAAGAGGCTA TCTCTCCGCC 
TGATGAGACG ACGCACGAGA CCCACGTGTC TTTCTCCGAT AGAGAGGCGG 



GGATGCTGCA TCTGCTGCAC CGCTGCGTAC CATCACTGCT GATACCTTCC 
CCTACGACGT AGACGACGTG GCGACGCATG GTAGTGACGA CTATGGAAGG 



GCAAACTGTT TCGTGTATAC TCTAACTTCC TGCGTGGTAA ACTGAAACTG 
CGTTTGACAA AGCACATATG AGATTGAAGG ACGCACCATT TGACTTTGAC 

Sail 

TALACTGGCG AAGCATGCCG TACTGGTGAC CGCTAATAG 

ATATGACCGC TTCGTACGGC ATGACCACTG GCGATTATCA GCT 



More particularly, Table VIII illustrates oligonucleotides employed to generate the Section 1 of the ECEPO gene 
encoding amino terminal residues of the human species polypeptide. Oligonucleotides were assembled into duplexes 
( 1 and 2, 3 and 4, etc.) and the duplexes were then ligated to provide ECEPO Section 1 as in Table IX. Note that the 
assembled section includes respective terminal EcoRI and BarriH] sticky ends, that "downstream" of the EcoRI sticky 
end is a Xba\ restriction enzyme recognition site; and that "upstream" of the BamHl sticky end is a Kprii recognition 
site. Section 1 could readily be amplified using the M13 phage vector employed for verification of sequence of the 
section. Some difficulties were encountered in isolating the section as an XbaVKprti fragment from RF DNA generated 
in E.coli, likely due to methylation of the Kpri recognition site bases within the host. Single-stranded phage DNA was 
therefore isolated and rendered into double-stranded form in vitro by primer extension and the desired double-stranded 
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fragment was thereafter readily isolated. 

ECEPO gene Sections 2 and 3 (Tables XI and Xlll).were constructed in a similar manner from the oligonucleotides 
of Tables X and XII, respectively. Each section was amplified in the M1 3 vector employed for sequence verification 
and was isolated from phage DNA. As is apparent from Table XI, ECEPO Section 2 was constructed with EcoRI and 

5 BaniHl sticky ends and could be isolated as a Kpn\/Bgl\\ fragment. Similarly, ECEPO Section 3 was prepared with 
BamH\ and Sail sticky ends and could be isolated from phage RF DNA as a Bgl\\/Sah fragment. The three sections 
thus prepared can readily be assembled into a continuous DNA sequence (Table XIV) encoding the entire human 
species EPO polypeptide with an amino terminal methionine codon (ATG) for Ecoli translation initiation. Note also that 
"upstream" of the initial ATG is a series of base pairs substantially duplicating the ribosome binding site sequence of 

10 the highly expressed OMP — f gene of E.coli 

Any suitable expression vector may be employed to carry the ECEPO. The particular vector chosen for expression 
of the ECEPO gene as the "temperature sensitive" plasmid pCFM536 — a derivative of plasmid pCFM414 (A.T.C.C. 
40076) — as described in co-pending U.S. Patent Application Serial No. 636,727, filed August 6, igB4, by Charles F. 
Morris. More specifically, pCFM536 was digested with Xba\ and HincAW; the large fragment was isolated and employed 

is in a two-part ligation with the ECEPO gene. Sections 1 (Xba\/Kpn\), 2 (Kpn\/Bgl\\) and 3 (BghUSah) had previously 
been assembled in the correct order in M1 3 and the EPO gene was isolated therefrom as a single Xba\/HincA\\ fragment. 
This fragment included a portion of the polylinker from M13 mp9 phage spanning the SaL to HincAW sites therein. 
Control of expression in the resulting expression plasmid, p536, was by means of a lambda P L promoter, which itself 
may be under control of the C 1857 repressor gene (such as provided in Ecoli strain K12AHtrp). 

20 The manufactured ECEPO gene above may be variously modified to encode erythropoietin analogs such as [Asn 2 , 

des-Pro 2 through lle 6 ]hEPO and [His 7 ]hEPO, as described below. 

A. [Asn 2 des-Pro 2 through lle 6 ]hEPO 

25 Plasmid 536 carrying the ECEPO manufactured gene of Table XIV as a Xba\ to HindlU insert was digested with 

HincAW and X/toL The latter endonuclease cuts the ECEPO gene at a unique, 6 base pair recognition site spanning 
the last base of the codon encoding Asp 8 through the second base of the Arg 10 codon. A Xba\/Xho\ "linker" sequence 
was manufactured having the following sequence: 

8 9 
Asp Xhol , 
GAC-3* 
CTG AGCT-5 

The XbaUXhol linker and the Xhdl HincAW ECEPO gene sequence fragment were inserted into the large fragment 
resulting from Xba\ and HincAW digestion of plasmid pCFM526 — a derivative of plasmid pCFM414 (A.T.C.C. 40076) 
— as described in co-pending U.S. Patent Application Serial No. 636,727, filed August 6, 1984, by Charles F. Morris, 
to generate a plasmid-bome DNA sequence encoding E.coli expression of the Mef 1 form of the desired analog. 

40 

B. [His?]hEPO 

Plasmid 536 was digested with HincAW and Xho\ as in part A above. A Xba\/Xho\ linker was manufactured having 
the following sequence: 

45 m ) 

Xbal +12 3 A 5 6 7 8 9 Xho l 

Met Ala Pro Pro Arg Leu He His Asp 

5 -CTAG ATG GCT CCG CCA CGT CTG ATC CAT GAC-3'* 

so 3 -TAC CGA GGC GGT GCA GAC TAG GTA CTG AGCT-5' 

The linker and the XhoUHincAU ECEPO sequence fragment were then inserted into pCFM526 to generate a plas- 
mid-bome DNA sequence encoding E.coli expression of the Met' 1 form of the desired analog. 

Construction of a manufactured gene ("SCEPO") incorporating yeast preference codons is as described in the 
55 following Tables XV through XXI. As was the case with the ECEPO gene, the entire construction involved formation 
of three sets of oligonucleotides (Tables XV, XVII and XIX) which were formed into duplexes and assembled into sec- 
tions (Tables XVI, XVIII and XX). Note that synthesis was facilitated in part by use of some sub-optimal codons in both 
the SCEPO and ECEPO constructions, i.e., oligonucleotides 7 — 12 of Section 1 of both genes were identical, as were 
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Xbal 





♦1 


2 


. 7 


Met 


Ala 


Asn 


Cys 


ATG 


GCT 


AAT 


TGC 


TAC 


CGA 


TTA 


ACG 
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oligonucleotides 1 — 6 of Section 2 in each gene. 





TABLE XV 


SCEPO 


SECTION 1 0LIG0NUCL FflT TflF^ 


1. 


AATTCAAGCTTGGATAAAAGAGCT . 


2. 


GTGGAGCTCTTTTATCCAAGCTTG 


3. 


CCACCAAGATTGATCTGTGACTC 


4. 


TCTCGAGTCACAGATCAATCTTG 


5. 


GAGAGTTTTGGAAAGATACTTGTTG 


6. 


CTTCCAACAAGTATCTTTCCAAAAC 


7. 


GAAGCTAAAGAAGCTGAAAACATC 


a. 


GTGGTGATGTTTTCAGCTTCTTTAG 


9. 


ACCACTGGTTGTGCTGAACACTGTTC 


10. 


CAAAGAACAGTGTTCAGCACAACCA 


11. 


TTTGAACGAAAACATTACGGTACCG 


12. 


GATCCGGTACCGTAATGTTTTCGTT 



TABLE XVI 
SCEPO SECTION 1 



EcoRI riindlli l 
AATTCA AGCTTGGATA 
G T TGGAACCTAT 
2 



L 1 3 

AAAGAGCTfcc_ACCAACATTG ATCTGTGACT cbAGAGTTTT 
TTTCTCGAGG TGpTTCTAAC TAGACACTGA GCTCT^AAAA 

1 L 2 

CG«AAGATAC TTGTTGfeAAG CTAAAGAAGC TGAAAACATC hCCACTGGTT 

CCTTTCTATG AACAACCTTcj GATTTCTTCG ACTTTTGTAG TEgTgTACCAA 

i. 8 ' 

- i 11 Kpnl BamHI 

GTGCTGAACA CTGTTCjrTJG AACCAAAACA TTACGGTACC G 

CACGACTTGT GACAAGAAAC] TTGCTTTTGT AATGCCATGG CCTAG 

■ 12. 
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TABLE XVII 


SCEPO 


SECTION 2 OLIGONUCLEOTIDES 


1 


«M l 1 CGGTACCAGACACCAAGGT 


^ • 


Ui i AACCTTGGTGTCTGGTACCG 


j • 


i mac r i c » ACGCTTGGAAACGTAT . 


A 

** • 


1 ICCATACGTTTCCAAGCGTAGAA 


C 

• 


GGAAGTTGGiCAACAAGCAGTTGAAGT 


o • 


CCAAACTTCAACTGCTTGTTGACCAAC 


7 


TTGGCAAGGTTTGGCCTTGTTATCTG 


a 

0 . 


GCTTCAGATAACAAGGCCAAACCTTG 


Q 

7 • 


AAGCTGTTTTGAGAGGTCAAGCCT 


xu - 


AACAAGGCTTGACCTCTCAAAACA 


11 . 


TGTTGGTTAACTCTTCTCAACCATGGG 


12. 


TGGTTCCCATGGTTGAGAAGAGTTAACC 


13 . 


AACCATTGCAATTGCACGTCGAT 


14. 


CTTTATCGACGTGCAATTGCAA 


15. 


AAAGCCGTCTCTGGTTTGAGATCTG 


16. 


GATCCAGATCTCAAACCAGAGACGG 
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TABLE XVIII 
SCEPO SECTION 2 



Kpn l 
EcoRI 1 

A ATTCGGTACC AGACACCAAG 
GCCATGG TCTGTGGTTC 
2 



U 1 l 5 

GT pACT TCT ACGCTTGGAA ACGTATDGAA GTTGGTCAAC AAGCTGTTGA 
CAATTGfAGA TGCGAACCTT TGCATACCTIJ CAACCAGTTG TTCGACAACT 
- 6 
7 9 

agtJttggcaa ggtttggcct tcttatctg E* agct gttttg agaggtcaag 

TCAAACCpTT CCAAACCGGA ACAATAGACT TCGlftCAAAAC TCTCCAGTTC 

I IS 

. II 13 

CCTfrGTTGGT TAACTCTTCT CAACCATGGG MCCAT TGCA~ATTGCACGTC 
GGAACAACCA ATTGAGAAGA GTTGGTACCC TTSgIIAACGT TAACGTGCAG 

11 " 14 

15 BolII BamHI 

gat raagc cg tctctggttt gagatctg — 
ctaTTTc^gc agagaccaAa ctctagaccta g 

16 
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TABLE XIX 


cr cpn 

JlCr u 


atu I XUN 3 OLIGONUCLEOTIDES 


1. 


GATCCAGATCTTTGACTACTTTGTT 


2. 


TCTCAACAAAGTAGTCAAAGATCTG 


3. 


GAGAGCTTTGGGTGCTCAAAAGGAAG 


4. 


ATGGCTTCCTTTTGAGCACCCAAAGC 


5. 


CCATTTCCCCACCAGACGCTGCTT 


6. 


GCAGAAGCAGCGTCTGGTGGGGAA 


7. 


CTGCCGCTCCATTGAGAACCATC 


8. 


CAGTGATGGTTCTCAATGGAGCG 


9. 


ACTGCTGATACCTTCAGAAAGTT 


10. 


GAATAACTTTCTGAAGGTATCAG 


11. 


ATTCAGAGTTTACTCCAACTTCT 


12. 


CTCAAGAAGTTGGAGTAAACTCT 


13. 


TGAGAGGTAAATTGAAGTTGTACAC 


14. 


ACCGGTGTACAACTTCAATTTACCT 


15. 


CGGTGAAGCCTGTAGAACTGGT 


16. 


CTGTCACCAGTTCTACAGGCTTC 


17. 


GACAGATAAGCCCGACTGATAA 


18. 


GTTGTTATCAGTCGGGCTTAT 


19. 


CAACAGTGTAGATGTAACAAAG 


20. 


TCGACTTTGTTACATCTACACT 
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TABLE XX 
SCEPO SECTION 3 

BamHI Balll 1 

GATC CAGATCTTTG ACTACTTTGT ThAGAGCTTT 
GTC TAGAAAC TGATGAAACA ACTTftGAAA 
2 r 

I 5 

■SSISSKJf ^" ^ C i TT TCCCCACC AGACGCTGCT TCTGCCGCTC 
CCCACGAGTT TTCCTTCGGTa^AGGGGTCG TCTGCGACGA AGACGGCCAG 

- £ 

1 9 11 

rrllrrrrrl SiI£BKE T GATACCTTCA GAAAGTlhTT CAGAGTTTAC 
GTAACTCTTG GTAGTEac]3A CTATGGAAGT CTTTCAATTSiT&CTCAAATG- 

- iS. il 

12 15 
TCCAACTTCT [TGAC AGGTAA ATTGAAGTTG TACACfcGGTG AAGCCTGTAG 
AGGTTGAAGA ATTc]TCCATT TAACTTCAAC ATGT(SBfc TTCGCACATC 



12 19 



AACTGGT hAC AGA TAAGCCC GACTGATAAfc AACAGTGTAG 
TTGACCACTG TCITATTCGGG CTGACTATTGTTQrCACATC 

18 n 



Sail 

ATGTAACAAA G 
TACATTGTTT CAGCT 
20 
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TABLE XXI 
SCEPQ GENE 

-1 +1 
Hind lll ArgAla 

AGCTTGGATA AAAGAGCTCC ACCAAGATTG ATCTGTGACT CGAGAGTTTT 
ACCTAT TTTCTCGAGG TGGTTCTAAC TAGACACTGA GCTCTCAAAA 

SJJJJJIiS II^II GGAAG CTAflft GAAGC TGAAAACATC ACCACTGGTT 
CCTTTCTATG AACAACCTTC GATTTCTTCG ACTTTTGTAG TGGTGACCAA 

GTGCTGAACA CTGTTCTTTG AACGAAAACA TTACGGTACC AGACACCAAG 
CACGACTTGT GACAAGAAAC TTGCTTTTGT AATGCCATGG TCTGTGGT?c 

GTTAACTTCT ACGCTTGGAA ACGTATGGAA GTTCGTCAAC AAGCTGTTGA 

caattgaaga tgcgaacctt tgcatacctt caaccagttg tt?gacaact 

AGTTTGGCAA GGTTTGGCCT TGTTATCTGA AGCTGTTTTG AGAGGTCAAG 
TCAAACCGTT CCAAACCCCA ACAATAGACT TCGACAAAAC TCTCCAGTTC 

SIISIISI IJiSISIISr CAACCATG GC AACCATTGCA ATTGCACGTC 
GGAACAACCA ATTGAGAAGA GTTGGTACCC TTGGTAACGT TAACGTGCAG 

GATAAAGCCG TCTCTGGTTT GAGATCTTTG ACTACTTTGT TGAGAGCTTT 
CTATTTCGGC AGAGACCAAA CTCTAGAAAC TGATGAAACA ACTCTCGAAA 

GGGTGCTCAA AAGGAAGCCA TTTCCCCACC AGACGCTGCT TCTGCCGCTC 
CCCACGAGTT TTCCTTCGGT AAAGGGGTGG TCTGCGACGA AGACGGCGAG 

CATTGAGAAC CATCACTGCT GATACCTTCA GAAAGTTATT CAGAGTTTAC 
GTAACTCTTG GTAGTGACGA CTATGGAAGT CTTTCAATAA GTCTCAAATG 

Irrirrllrl InrnrnZl^ ^TTGAAGTTG TACACCGGTG AAGCCTGTAG 
AGGTTGAAGA ACTCTCCATT TAACTTCAAC ATGTGGCCAC TTCGGACATC 

AACTGGTGAC AGATAAGCCC GACTGATAAC AACAGTGTAG 
TTGACCACTG TCTATTCGGG CTGACTATTG TTGTCACATC 

Sail 

ATGTAACAAA G 
TACATTGTTT CAGCT 



The assembled SCEPO sections were sequenced in M1 3 and Sections 1 , 2 and 3 were isolatable from the phage 
as Hind\WKprt/BGI\, and BgM/Sal> fragments. 

The presently preferred expression system for SCEPO gene products is a secretion system based on S.cerevisiae 
a-factor secretion, as described in co-pending U.S. Patent Application Serial No. 487,753, filed April 22, 1 983, by Grant 
A. Bitter, published October 31, 1984 as European Patent Application 0 123 294. Briefly put, the system involves 



44 



EP 0 148 605 B2 



constructions wherein DNA encoding the leader sequence of the yeast a-factor gene product is positioned immediately 
5' to the coding region of the exogenous gene to be expressed. As a result, the gene product translated includes a 
leader or signal sequence which is "processed off 1 by an endogenous yeast enzyme in the course of secretion of the 
remainder of the product. Because the construction makes use of the a-factor translation initiation (ATG) i codon, there 

s was no need to provide such a codon at the -1 position of the SCEPO gene. As may be noted from Table XXI, the 
alanine (+1 ) encoding sequence is preceded by a linker sequence allowing for direct insertion into a plasmid including 
the DNA for the first 80 residues of the a-factor leader following the a-factor promoter. The specific preferred construc- 
tion for SCEPO gene expression involved a four-part ligation including the above-noted SCEPO section fragments and 
the large fragment of HindWU Sal\ digestion of plasmid paC3. From the resulting plasmid paC3/SCEPO, the a-factor 

io promoter and leader sequence and SCEPO gene were isolated by digestion with BamHI and ligated into BamHI di- 
gested plasmid pYE to form expression plasmid pYE/SCEPO. 

Example 12 



The present example relates to expression of recombinant products of the manufactured ECEPO and SCEPO 
genes within the expression systems of Example 11 . 

In use of the expression system designed for use of E.coli host cells, plasmid p536 of Example 1 1 was transformed 
into AM7 E.coli ce\\s previously transformed with a suitable plasmid, pMW1, harboring a C 1857 gene. Cultures of cells 
in LB broth (Ampicillin 50 u.g/ml and kanamycin 5 fig/ml, preferably with 10 mM MgS0 4 ) were maintained at 28°C and 
upon growth of cells in culture to O.D. 600 = 0.1, EPO expression was induced by raising the culture temperature to 
42°C. Cells grown to about 40 O.D. provided EPO production (as estimated by gel) of about 5 mg/OD liter. 

Cells were harvested, lysed, broken with French Press (10,000 psi) and treated with lysozyme and NP — 40 de- 
tergent. The pellet resulting from 24,000 xg centrifugation was solubilized with guanidine HCI and subjected to further 
purification in a single step by means of C 4 (Vydac) Reverse Phase HPLC (EtOH, 0-^80%, 50 mM NH 4 Ac, pH 4.5). 
Protein sequencing revealed the product to be greater than 95% pure and the products obtained revealed two different 
amino terminals A — P — P — R...and P— P— R...in a relative quantitative ratio of about 3 to 1 . This latter observation 
of hEPO and [des Ala 1 ]hEPO products indicates that amino terminal "processing" within the host cells serves to remove 
the terminal methionine and in some instances the initial alanine. Radioimmunoassay activity for the isolates was at a 
level of 1 50,000 to 160,000 U/mg; in vitro assay activity was at a level of 30,000 to 62,000 U/mg; and in vivo assay 
activity ranged from about 120 to 720 U/mg: (Cf., human urinary isolate standard of 70,000 U/mg in each assay.) The 
dose response curve for the recombinant product in the in viva assay differed markedly from that of the human urinary 
EPO standard. 

The EPO analog plasmids formed in parts A and B of Example 11 were each transformed into pMW1 -transformed 
AM7 E.coli cells and the cells were cultured as above. Purified isolates were tested in both RIA and in vitro assays. 
RIA and in vitro assay values for [Asn 2 , des-Pro 2 through lle 6 ]hEPO expression products were approximately 11,000 
U/mg and 6,000 U/mg protein, respectively, while the assay values for [His 7 ]hEPO were about 41 ,000 U/mg and 1 4,000 
U/mg protein, respectively, indicating that the analog products were from one-fourth to one-tenth as "active" as the 
"parent" expression product in the assays. 

In the expression system designed for use of S.cerevisiae host cells, plasmid pYE/SCEPO was transformed into 
two different strains, YSDP4 (genotype a pep4—3 trp1) and RK81 (genotype aa pep4—3 trp1). Transformed YSDP4 
hosts were grown in SD medium (Methods in Yeast Genetics, Cold Spring Harbor Laboratory, Cold Spring Harbor, N. 
Y, p. 62 (1 983) supplemented with casamino acids at 0.5%, pH 6.5 at 30°C. Media harvested when the cells had been 
grown to 36 O.D. contained EPO products at levels of about 244 Ug/ml (97 ng/OD liter by RIA). Transformed RK81 
cells grown to either 6.5 O.D. or 60 O.D. provided media with EPO concentrations of about 80—90 U/ml (34 u.g/OD 
liter by RIA). Preliminary analyses reveal significant heterogeneity in products produced by the expression system, 
likely to be due to variations in glycosylate of proteins expressed, and relatively high mannose content of the asso- 
ciated carbohydrate. 

Plasmids PaC3 and pYE in HB101 E.coli cells were deposited in accordance with the Rules of Practice of the U. 
S. Patent Office on September 27, 1984, with the American Type Culture Collection, 12301 Parklawn Drive, Rockville, 
Maryland, under deposit numbers AT.C.C. 39881 and ATC.C. 39882, respectively. Plasmids pCFM526 in AM7 cells, 
pCFM536 in JM1 03 cells, and pMWl in JM103 cells were likewise deposited on November 21,1 984 as A.T.C.C. 39932, 
39934, and 39933, respectively. Saccharomyces cerevisiae strains YSPD4 and RK81 were deposited on November 
21, 1984 as A.T.C.C. 20734 and 20733, respectively. 

It should be readily apparent from consideration of the above illustrative examples that numerous exceptionally 
valuable products and processes are provided by the present invention in its many aspects. 

Polypeptides provided by the invention are conspicuously useful materials, whether they are microbiafly expressed 
products or synthetic products, the primary, secondary or tertiary structural conformation of which was first made known 
by the present invention. 
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As previously indicated, recombinant-produced and synthetic products of the invention share, to varying degrees, 
the in vitro biological activity of EPO isolates from natural sources and consequently are projected to have utility as 
substitutes for EPO isolates in culture media employed for growth of erythropoietic cells in culture. Similarly, to the 
extent that polypeptide products of the invention share the in vivo activity of natu ral EPO isolates they are conspicuously 

s . suitable for use in erythropoietin therapy procedures practiced on mammals, including humans, to develop any or all 
of the effects heretofore attributed in vivo\o EPO, e.g., stimulation of reticulocyte response, development of ferrokinetic 
effects (such as plasma from turnover effects and marrow transit time effects), erythrocyte mass changes, stimulation 
of hemoglobin C synthesis (see, Eschbach, at al., supra) and. as indicated in Example 10, increasing hematocrit levels 
in mammals. Included within the class of humans treatable with products of the invention are patients generally requiring 

io blood transfusions and including trauma victims, surgical patients, renal disease patients including dialysis patients, 
and patients with a variety of blood composition affecting disorders, such as hemophilia, sickle cell disease, physiologic 
anemias, and the like. The minimization of the need for transfusion therapy through use of EPO therapy can be expected 
to result in reduced transmission of infectious agents. Products of the invention, by virtue of their production by recom- 
binant methods: are expected to be free of pyrogens, natural inhibitory substances, and the like, and are thus likely to 

15 provide enhanced overall effectiveness in therapeutic processes vis-a-vis naturally derived products. Erythropoietin 
therapy with products of the present invention is also expected to be useful in the enhancement of oxygen carrying 
capacity of individuals encountering hypoxic environmental conditions and possibly in providing beneficial cardiovas- 
cular effects. 

A preferred method for administration of polypeptide products of the invention is by parenteral (e.g.. IV, IM, SC, 

20 or IP) routes and the compositions administered would ordinarily include therapeutically effective amounts of product 
in combination with acceptable diluents, carriers and/or adjuvants. Preliminary pharmacokinetic studies indicate a long- 
er half-life in vivo for monkey EPO products when administered IM rather than IV Effective dosages are expected to 
vary substantially depending upon the condition treated but therapeutic doses are presently expected to be in the range 
of 0.1 (~7U) to 100 (-7000U) ng/kg body weight of the active material. Standard diluents such as human serum 

2S albumin are contemplated for pharmaceutical compositions of the invention, as are standard carriers such as saline. 

Adjuvant materials suitable for use in compositions of the invention include compounds independently noted for 
erythropoietic stimulatory effects, such as testosterones, progenitor cell stimulators, insulin-like growth factor, prostag- 
landins, serotonin, cyclic AMP, prolactin and triiodothyronine, as well as agents generally employed in treatment of 
aplastic anemia, such as methenolene, stanozolol and nandrolone [see, e.g., Resegotti, at al., Panminerva Medica, 

30 23, 243—248 (1 981 ); McGonigle, at al. , Kidney Int., 25(2), 437—444 (1 984); Pavlovic-Kantera, at al., Expt. Hematol'., 
8(Supp. 8), 283—291 (1980); and Kurtz, FEBS Letters, 14a(1), 105—108 (1982)1. Also contemplated as adjuvants 
are substances reported to enhance the effects of. or synergize, erythropoietin or asialo-EPO, such as the adrenergic 
agonists, thyroid hormones, androgens and BPA [see, Dunn, "Current Concepts in Erythropoiesis". John Wiley and 
Sons (Chichester, England, 1983); Weiland, atal., Btut. 44(3), 173—175 (1982); Kalmanti, Kidney Int., 22, 383—391 

35 (1982); Shahidi, New Eng. J. Med., 289, 72—80 (1973); Fisher, at al.„ Steroids, 30(6), 833—845 (1977); Urabe, at 
al., J. Exp. Med., 149, 1314 — 1325 (1979); and Billat, et al., Expt. Hematol, 10(1), 133—140 (1982)1 as well as the 
classes of compounds designated "hepatic erythropoietic factors" [see, Naughton, at al., Acta. Haemal, 69, 171 — 179 
(1983)] and "eryihrotropins" [as described by Congote, at al. in Abstract 364, Proceedings 7th International Congress 
of Endocrinology (Quebec City, Quebec, July 1-7, 1984); Congote, Biochem. Biophys. Res. Comm., 115(2), 447 — 483 

^0 (1 983) and Congote, Anal. Biochem., 140, 428^33 (1 984)1 and "erythrogenins" [as described in Rothman, at al., J. 
Surg. Oncol., 20, 105—108 (1982)]. Preliminary screenings designed to measure erythropoietic responses of ex- 
hypoxic polycythemic mice pre-treated with either 5-cc-dihydrotestosterone or nandrolone and then given erythropoietin 
of the present invention have generated equivocal results. 

Diagnostic uses of polypeptides of the invention are similarly extensive and include use in labelled and unlabelled 

45 forms in a variety of immunoassay techniques including RIA's, ELISA's and the like, as well as a variety of vitro and in 
viva activity assays. See, e.g.. Dunn, at al., Expt. Hematol., 11(7), 590—600 (1983); Gibson, at al., Pathology, 16, 
155—156 (1984); Krystal, Expt Hematol., 11(7), 649— 660 (1983); Saito, et al., Jap. J. Med., 23(1), 15—21 (1984); 
Nathan at al., New Eng. J. Med., 308(9), 520 — 522 (1983); and various references pertaining to assays referred to 
therein. Polypeptides of the invention, including synthetic peptides comprising sequences of residues of EPO first 

50 revealed herein, also provide highly useful pure materials for generating polyclonal antibodies and "banks" of mono* 
clonal antibodies specific for differing continuous and discontinuous epitopes of EPO. As one example, preliminary 
analysis of the amino acid sequences of Table VI in the context of hydropathicity according to Hopp, at al., P.N.A.S. 
(U.S.A.), 78, pp. 3824-3828 (1981) and of secondary structures according to Chou, at al., Ann. Rev. Biochem., 47, p. 
251 (1 978) revealed that synthetic peptides duplicative of continuous sequences of residues spanning positions 41 -57 

55 inclusive, 116-118 inclusive and 1 44-1 66 inclusive are likely to produce a highly antigenic response and generate useful 
monoclonal and polyclonal antibodies immunoreactive with both the synthetic peptide and the entire protein. Such 
antibodies are expected to be useful in the detection and affinity purification of EPO and EPO-related products but are 
not claimed herein. 
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Illustratively, the following three synthetic peptides were prepared: 

(1) hEPO 41-57, V-P-D-T-K-V-N-F-Y-A-W-K-R-M-E-V-G; 

(2) hEPO 116-128, K-E-A-l-S-P-P-D-A-A-S-A-A; 

(3) hEPO 144-166, V-Y-S-N-F-L-R-G-K-L-K-L-Y-T-G-E-A-C-R-T-G-O-R, 

Preliminary immunization studies employing the above-noted polypeptides have revealed a relatively weak positive 
response to hEPO 41—57, no appreciable response to hEPO 116—128, and a strong positive response to hEPO 
144—146, as measured by capacity of rabbit serum antibodies to immunoprecipitate 125 l-labelled human urinary EPO 
isolates. Preliminary in vivo activity studies on the three peptides revealed no significant activity either alone or in 
combination. 

While the reduced sequences of amino acid residues of mammalian EPO provided by the illustrative examples 
essentially define tne primary structural conformation of mature EPO, it will be understood that the specific sequence 
of 165 amino acid residues of monkey species EPO in Table V and the 166 residues of human species EPO in Table 
VI do not limit the scope of useful polypeptides provided by the invention. Comprehended by the present invention are 
those various naturally-occurring allelic forms of EPO which past research into biologically active mammalian polypep- 
tides such as human y interferon indicates are likely to exist. Compare, e.g., the human immune interferon species 
reported to have an arginine residue at position No. 140 in EPO published application 0 077 670 and the species 
reported to have glutamine at position No. 140 in Gray, et al., Nature, 295, pp. 503—508 (1982). Both species are 
characterized as constituting "mature" human y interferon sequences.) Allelic forms of mature EPO polypeptides may 
vary from each other and from the sequences of Tables V and VI in terms of length of sequence and/or in terms of 
deletions, substitutions, insertions or additions of amino acids in the sequence, with consequent potential variations 
in the capacity for glycosylation. As noted previously, one putative allelic form of human species EPO is believed to 
include a methionine residue at position 126. Expectedly, naturally-occurring allelic forms of EPO-encoding DNA and 
cDNA secuences are also likely to occur which code for the above-noted types of allelic polypeptides or simply employ 
differing codons for designation of the same polypeptides as specified. 

In addition to naturally-occurring allelic forms of mature EPO, the present invention also embraces other "EPO 
procucts" such as polypeptide analogs of EPO and fragments of "mature" EPO. Following the procedures of the above- 
noted published application by Alton, at al. (WO/83/04053) one may readily design and manufacture genes coding for 
microbial expression of polypeptides having primary conformations which differ from that herein specified for mature 
EPO in terms of the identity or location of one or more residues (e.g.. substitutions, terminal and intermediate additions 
and deletions). Alternately, modifications of monkey cDNA and genomic EPO genes may be readily accomplished by 
well-known site-directed mutagenesis techniques and employed to generate analogs and derivatives of EPO. Such 
EPO. products would share at least one of the biological properties of EPO but may differ in others. As examples, 
projected EPO products of the invention include those which are foreshortened by e.g., deletions [Asn*, des-Pro* 
through lle*]hEPO, [des-Thr"* through Arg^]hEPO and "A27— 55hEPO", the latter having the residues coded for 
by an entire exon deleted; or which are more stable to hydrblysis.(and, therefore, may have more pronounced or longer 
lasting effects than naturally-occurring EPO); or which have been altered to delete one or more a potential sites for 
glycosylation (which may result in higher activities for yeast-produced products); or which have one or more cystein 
residues deleted or replaced by, e.g., histidine or serine residues (such as the analog [His 7 ]hEPO) and are potentially 
more easily isolated in active form from microbial systems; or which have one or more tyrosine residues replaced by 
phenylalanine (such as the analogs [Phe' 5]hEPO, [Phe^JhEPO. and [Phe^jnEPO) and may bind more or less readily 
to EPO receptors on target cells. Also comprehended are polypeptide fragments duplicating only a part of the contin- 
uous amino acid sequence or secondary conformations within mature EPO, which fragments may possess one activity 
of EPO (e.g., receptor binding) and not others (e.g., erythropoietic activity). Especially significant in this regard are 
those potential fragments of EPO which are elucidated upon consideration of the human genomic DNA sequence of 
Table VI, i.e., "fragments" of the total continuous EPO sequence which are delineated by intron sequences and which 
may constitute distinct "domains" of biological activity It is noteworthy that the absence of in vivo activity for any one 
or more of the "EPO products" of the invention is not wholly preclusive of therapeutic utility (see. Weiland, at al., supra) 
or of utility in other contexts, such as in EPO assays or EPO antagonism. Antagonists of erythropoietin may be quite 
useful in treatment of polycythemias or cases of overproduction of EPO [see, e.g., Adamson, Hasp. Pracrice, 18(12) 
49—57 (1983), and Hellman, at al., Clin. Lab. Haemal, 5, 335—342 (1983)]. 

According to another aspect of the present invention, the cloned DNA sequences described herein which encode 
human and monkey EPO polypeptides are conspicuously valuable for the information which they provide concerning 
the amino acid sequence of mammalian erythropoietin which has heretofore been unavailable despite decades of 
analytical processing of isolates of naturally-occurring products. The DNA sequences are also conspicuously valuable 
as products useful in effecting the large scale microbial synthesis of erythropoietin by a variety of recombinant tech- 
niques. Put another way, DNA sequences provided by the invention are useful in generating new and useful viral and 
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circular plasmid DNA vectors, new and useful transformed and transfected microbial procarybtic and eucaryotic host 
cells (including bacterial and yeast cells and mammalian cells grown, in culture), and new and useful methods for 
cultured growth of such microbial host cells capable of expression of EPO and EPO products. DNA sequences of the 
invention are also conspicuously suitable materials for use as labelled probes in isolating EPO and related protein 

5 encoding cDNA and genomic DNA sequences of mammalian species other than human and monkey species herein 
specifically illustrated. The extent to which DNA sequences of the invention will have use in various alternative methods 
of protein synthesis (e.g., in insect cells) or in genetic therapy in humans and other mammals cannot yet be calculated. 
DNA sequences of the invention are expected to be useful in developing transgenic mammalian species which may 
serve as eucaryotic 'hosts" for production of erythropoietin and erythropoietin products in quantity. See, generally, 

10 Palmiter, etal., Sc/ence, 222(4625), 809— 814(1983). 

Viewed in this light, therefore, the specific disclosures of the illustrative examples are clearly not intended to be 
limiting upon the scope of the present invention and numerous modifications and variations are expected to occur to 
those skilled in the art. As one example, while DNA sequences provided by the illustrative examples include monkey 
cDNA and genomic DNA sequences, because this application provides amino acid sequence information essential to 

is manufacture of DNA sequence, the invention also comprehends such manufactured DNA sequences as may be con- 
structed based on knowledge of EPO amino acid sequences. These may code for EPO (as in Example 12) as well as 
for EPO fragments and EPO polypeptide analogs (i.e., "EPO Products") which may share one or more biological prop- 
erties of naturally-occurring EPO but not share others (or possess others to different degrees). 

DNA sequences provided by the present invention are thus seen to comprehend all DNA sequences suitable for 

20 use in securing expression in a procaryotic or eucaryotic host cell of a polypeptide product having at least a part of the 
primary structural conformation and one or more of the biological properties of eryth ropoietin, and selected from among: 
(a) the DNA sequences set out in Tables V and VI; (b) DNA sequences which hybridize to the DNA secuences defined 
in (a) or fragments thereof; and (c) DNA sequences which, but for the degeneracy of the genetic code, would hybridize 
to the DNA sequences defined in (a) and (b). It is noteworthy in this regard, for example, that existing allelic monkey 

25 and human EPO gene sequences and other mammalian species gene sequences are expected to hybridize to the 
sequences of Tables V and VI or to fragments thereof. Further, but for the degeneracy of the genetic code, the SCEPO 
and EC EPO genes and the manufactured or mutagenized cDNA or genomic DNA sequences encoding various EPO 
fragments and analogs would also hybridize to the above-mentioned DNA sequences. Such hybridizations could readily 
be carried out under the hybridization conditions described herein with respect to the initial isolation of the monkey 

30 and human EPO-encoding DNA or more stringent conditions, if desired to reduce background hybridization. 

In a like manner, while the above examples illustrate the invention of microbial expression of EPO products in the 
context of mammalian cell expression of DNA inserted in a hybrid vector of bacterial plasmid and viral genomic origins, 
a wide variety of expression systems are within the contemplation of the invention. Conspicuously comprehended are 
expression systems involving vectors of homogeneous origins applied to a variety of bacterial, yeast and mammalian 

35 cells in culture as well as to expression systems not involving vectors (such as calcium phosphate transf ection of cells). 
In this regard, it will be understood that expression of, e.g., monkey origin DNA in monkey host cells in culture and 
human host cells in culture, actually constitute instances of "exogenous" DNA expression inasmuch as the EPO DNA 
whose high level expression is sought would not have its origins in the genome of the host. Expression systems of the 
invention further contemplate these practices resulting in cytoplasmic formation of EPO products and accumulation of 

40 glycosylated and non-glycosylated EPO products in host cell cytoplasm or membranes (e.g., accumulation in bacterial 
periplasmic spaces) or in culture medium supernatants as above illustrated, or in rather uncommon systems such as 
R aeruginosa expression systems (described in Gray, at aL Biotechnology, 2, pp. 161—165 (1984). 

Improved hybridization methodologies of the invention, while illustratively applied above to DNA/DNA hybridization 
screenings are equally applicable to RNA/RNA and RNA/DNA screening. Mixed probe techniques as herein illustrated 

45 generally constitute a number of improvements in hybridization processes allowing for more rapid and reliable poly- 
nucleotide isolations. These many individual processing improvements include: improved colony transfer and mainte- 
nance procedures; use of nylon-based filters such as GeneScreen and GeneScreen Plus to allow reprobing with same 
filters and repeated use of the filter, application of novel protease treatments [compared, e.g., to Taub, at al.. Anal. 
Bhchem., 126, pp. 222—230 (1982)]; use of very low individual concentrations (on the order of 0.025 picomole) of a 

50 large number of mixed probes (e.g.. numbers in excess of 32); and, performing hybridization and post-hybridization 
steps under stringent temperatures closely approaching (i.e., within 4°C and preferably within 2°C away from) the 
lowest calculated dissocation temperature of any of the mixed probes employed. These improvements combine to 
provide results which could not be expected to attend their use. This is amply illustrated by the fact that mixed probe 
procedures involving 4 times the number of probes ever before reported to have been successfully used in even cDNA 

55 screens on messenger RNA species of relatively low abundancy were successfully applied to the isolation of a unique 
sequence gene in a genomic library screening of 1,500,000 phage plaques. This feat was accomplished essentially 
concurrently with the publication of the considered opinion of Anderson, at al., supra, that mixed probe screening 
methods were •...impractical for isolation of mammalian protein genes when corresponding RNA's are unavailable. 
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1. A DNA sequence for use in securing expression in a procaryotic or eucaryotic host cell of a polypeptide product 
having at least part of the primary structural confirmation [sic] of that of erythropoietin to allow possession of the 
biological property of causing bone marrow cells to increase production of reticulocytes and red blood cells and 
to increase hemoglobin [sic] synthesis or iron uptake, said DNA sequence selected from the group consisting of: 

(a) the DNA sequences set out in Tables V and VI or their complementary strands; 

(b) DNA sequences which hybridize under stringent conditions to the protein coding regions of the DNA se- 
quences defined in (a) or fragments thereof; and 

(c) DNA sequences which, but for the degeneracy of the genetic code, would hybridize to the DNA sequences 
defined in (a) and (b). 

2. A DNA sequence according to Claim 1 encoding human erythropoietin. 

3. A cDN A sequence according to Claim 1 being a monkey species erythropoietin coding DNA sequence. 

4. A DNA sequence according to Claim 3 and including the protein coding region set forth in Table V. 

5. A genomic DNA sequence according to Claim 1 or 2. 

6. A human species erythropoietin coding DNA sequence according to Claim 5. 

7. A DNA sequence according to Claim 6 and including the protein coding region set forth in Table VI. 

8. A DNA sequence according to Claim 1 or 2, covalently associated with a detectable label substance. 

9. A DNA sequence according to Claim 8, wherein the detectable label is a radiolabel. 

10. A single-strand DNA sequence according to Claim 8 or 9. 

11. A DNA sequence according to Claim 1 , coding for [Phe 15 ]hEPO, [Phe 49 ]hEPO, [Phe 145 ]hEPO, [His 7 ]hEPO, [Asn 2 
des-Pro2 through lle 6 ]hEPO, [des-Thr 16 3 through Arg 166 ]hEPO, or[A27-55]hEPO. 

12. A procaryotic or eucaryotic host cell transformed or transfected with a DNA sequence according to any one of 
Claims 1, 2, 3, 6, 7 and 8, in a manner allowing the host cell to express said polypeptide product. 

1 3. A transformed or transfected host cell according to C laim 1 2 which host cell is capable of glycosylating said polypep- 
tide. 

14. A transformed or transfected mammalian host cell according to Claim 13. 

15. A transformed or transfected COS cell according to Claim 1 3. 

16. A transformed or transfected CHO cell according to Claim 13. 

17. A biologically functional circular plasmid or viral DNA vector including a DNA sequence according to any one of 
Claims 1, 2, 3, 5, 6, 7, or 11. 

18. A procaryotic or eucaryotic host cell stably transformed or transfected with a DNA vector according to Claim 17. 

19. A recombinant polypeptide having part or all of the primary structural conformation of human or monkey erythro- 
poietin as set forth in Table VI or Table V or any allelic variant or derivative thereof possessing the biological 
property of causing bone marrow cells to increase production of reticulocytes and red blood cells to increase 
hemoglobin synthesis or iron uptake and characterized by being the product of eucaryotic expression of an exog- 
enous DNA sequence and which has higher molecular weight by SDS-PAGE from erythropoietin isolated from 
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urinary sources. 

20. A glycoprotein polypeptide according to Claim 1 9 having an average carbohydrate composition which differs from 
that of human erythropoietin isolated from urinary sources. 

5 

21. A polypeptide according to Claim 19 or 20 wherein the exogenous sequence is a cDNA sequence. 

22. A polypeptide according to Claim 1 9 or 20 wherein the exogenous DNA sequence is a genomic DN A sequence. 

10 23. A polypeptide according to Claim 1 9 or 20 wherein the exogenous DNA sequence is carried on an autonomously 
replicating circular DNA plasmid or viral vector. 

24. A polypeptide according to any one of Claims 1 9 to 23 further characterized by being covalently associated with 
a detectable label substance. 
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25. A polypeptide according to Claim 24, wherein said detectable label is a radiolabel. 

26. A polypeptide product of the expression in a eucaryotic host cell of a DNA sequence according to any of Claims 
1,2,3, 5, 6 and 7. 

27. A process for production of a polypeptide having at least part of the primary structural conformation of erythropoietin 
to allow possession of the biological property of causing bone marrow cells to increase production of reticulocytes 
and red blood cells and to increase hemoglobin synthesis or iron uptake, which process is characterized by culturing 
under suitable nutrient conditions a procaryotic or eucaryotic host cell transformed or transfected with a DNA 
sequence according to any of Claims 1 , 2, 3, 5, 6 and 7 in a manner allowing the host cell to express said polypep- 
tide; and optionally isolating the desired polypeptide product of the expression of the DNA sequence. 

28. A process according to Claim 27, characterized by culturing a host cell of any one of Claims 12 to 16. 

30 29. A process according to Claim 27 or 28 for production of a polypeptide of any one of Claims 1 9 to 23 and 26. 

30. A pharmaceutical composition comprising a polypeptide produced in accordance with the process of Claim 27 28 
or 29 and a pharmaceutically acceptable diluent, adjuvant or carrier. 

3S 31. a pharmaceutical composition according to Claim 30, comprising a polypeptide of any one of Claims 1 9 to 23 and 
26. 
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Patentanspruche 



1. 



DNA-Sequenz, die dafur verwendet wird, Expression in einer prokaryontischen oder eukaryontischen Wirtszelle 
eines Polypeptidprodukts zu erreichen, das zumindest teilweise die Primarstrukturkonformation von Erythropoietin 
aufweist, urn den Besitz der biologischen Eigenschaft zu ermoglichen, Knochenmarkszellen zu veranlassen die 
Produktion von Retikulozyten und roten Blutkorperchen zu steigern und Hamoglobinsynthese oder Eisenaufnahme 
zu steigern, wobei besagte DNA-Sequenz aus der Gruppe ausgewahlt ist, die aus 

(a) den in den Tabellen V und VI dargelegten DNA-Sequenzen oder deren Komplementarstrangen; 

(b) DNA-Sequenzen, die unter stringenten Bedingungen zu den proteincodierenden Bereichen derin (a) de- 
fimerten DNA-Sequenzen oder Fragmenten derselben hybridisieren; und 

(c) DNA-Sequenzen, die ohne die Entartung des genetischen Codes zu den in (a) und (b) definierten DNA- 
Sequenzen hybridisieren wurden; 

besteht. 

55 2. DNA-Sequenz nach Anspruch 1 , die Human-Erythropoiet in codiert. 

3. cDNA-Sequenz gemaG Anspruch 1 , die ein Erythropoietin einer Affenart codiert. 
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4. DNA-Sequenz nach Anspruch 3, die den in Tabelle V dargestellten proteincodierenden Bereich einschlieBt. 

5. Genomische DNA-Sequenz nach Anspruch 1 Oder 2. 

6. DNA-Sequenz nach Anspruch 5, die ein Erythropoietin einer menschlichen Spezies codiert. 

7. DNA-Sequenz nach Anspruch 6, die den in Tabelle VI dargestellten proteincodierenden Bereich einschlieBt. 

8. DNA-Sequenz nach Anspruch 1 oder 2, die kovalent mit einer nachweisbaren Markierungssubstanz verbunden ist. 

9. DNA-Sequenz nach Anspruch 8, dadurch gekennzeichnet, daB die nachweisbare Markierung eine radioaktive 
Markierung ist. 

10. Einzelstrangige DNA-Sequenz nach Anspruch 8 oder 9. 

11. DNA-Sequenz nach Anspruch 1, dief£ir[PheiS]hEPO, [Phe«]hEPO, [Phe^SjhEPO, [His 7 ]hEPO f Asides - Pro? 
bis Ue6]hEPO, [des - Thriw bis Argi66 ]hEPO oder [A27-55]hEPO codiert. 

12. Prokaryontische oder eukaryontische Wirtszelle, transformiert oder transtiziert mit einer DNA-Sequenz nach einem 
der Anspruche 1, 2; 3, 6, 7 und 8, in einer Weise, die es der Wirtszelle erlaubt. besagtes Polypeptidprodukt zu 
exprimieren. r 

13. Transformierte oder transfizierte Wirtszelle nach Anspruch 12. dadurch gekennzeichnet, daB die Wirtszelle in der 
Lage ist, besagtes Polypeptid zu glykosylieren. 

14. Transformierte oder transfizierte Saugetier-Wirtszelle nach Anspruch 1 3. 

15. Transformierte oder transfizierte COS-Zelle nach Anspruch 13. 

16. Transformierte oder transfizierte CHO-Zelle nach Anspruch 13. 

17. Biologisch f unktbneller Zirkular-PlasmfcJ- oder V.rus-DNA-Vektor, der eine DNA-Sequenz nach einem der Anspru- 
che 1, 2, 3, 5, 6, 7 oder 11 einschlieBt. 

18. Prokaryontische oder eukaryontische Wirtszelle, die mit einem DNA-Vektor nach Anspruch 17 stabil transfomiert 
oder transfiziert ist. 

19. Rekombinantes Polypeptid, das einen Teil oder die Gesamtheit der Primarstrukturkonformation von Human- oder 
Affen-Erythropoietin, wie dargestellt in Tabelle VI oder Tabelle V, oder irgendeine allelische Variante oder ein De- 
nvat desselben aufweist, das die biologische Eigenschaft besitzt, Knochenmarkszellen zu veranlassen die Pro- 
duktion von Ret.kulozyten oder roten Blutkorperchen zu steigem und Hamoglobinsynthese oder Eisenaufnahme 
zu steigern, dadurch gekennzeichnet, daB es das Produkt eukaryontischer Expression einer exogenen DNA-Se- 
quenz ist und welches gemaB SDS-PAGE ein hoheres Molekulargewicht hat als Erythropoietin, das aus Urin- 
Quellen isoliert ist 

20. Gtykoprotein-Polypeptid nach Anspruch 19. das eine mittlere Kohlenhydratzusammensetzung aufweist die sich 
von derjenigen von Human-Erythropoietin unterscheidet, das aus Urin-quellen isoliert ist. 

21. Polypeptid nach Anspruch 19 oder 20, dadurch gekennzeichnet, daB die exogene DNA-Sequenz eine cDNA- 
Sequenz ist. 

22. Polypeptid nach Anspruch 1 9 oder 20. dadurch gekennzeichnet, daB die exogene DNA-Sequenz eine genomische 
DNA-Sequenz ist. . 

23. Polypeptid nach Anspruch 19 oder 20, dadurch gekennzeichnet, daB die exogene DNA-Sequenz auf einem au- 
tonom rephzierenden Zirkular-DNA-Plasmid- oder Virus-Veklor sitzt. 

24. Polypeptid nach einem der Anspruche 1 9 bis 23, weiter dadurch gekennzeichnet, daB es kovalent mit einer nach- 
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weisbaren Markierungssubstanz verbunden ist. 

25. Polypeptid nach Anspruch 24, dadurch gekennzeichnet, da(3 besagte nachweisbare Markierung eine radioaktive 
Markierung ist. 

26. Polypeptidprodukt der Expression einer eukaroyntischen Wirtszelle einer DNA-Sequenz nach einem der Anspru- 
che 1 , 2, 3, 5, 6 und 7. 

27. Verfahren zur Herstellung eines Polypeptids, das zumindest teilweise die Primarstrukturkonformation von Erythro- 
poietin autweist, um den Besitz der biologischen Eigenschaft zu ermoglichen, Knochenmarkszellen zu veranlas- 
sen, die Produkt.on von Retikulozyten und roten Blutkorperchen zu steigem und Hamoglobinsyntbese Oder Ei- 
senaufnahme zu steigem, dadurch gekennzeichnet, daB eine prokaryontische oder eukaryontische Wirtszelle die 
nut einer DNA-Sequenz nach einem der Anspruche 1 , 2, 3, 5, 6 und 7 derart transformiert oder transfiziert wor'den 
ist, daB es der Wirtszelle ermoglicht ist, besagtes Polypeptid zu exprimieren, unter geeigneten Nahrbedingungen 
kultiviert w.rd; und daB fakultativ das gewunschte Polypeptidprodukt der Expression der DNA-Sequenz isoliert wird. 

28. Verfahren nach Anspruch 27, dadurch gekennzeichnet, daB eine Wirtszelle nach einem der Anspruche 12 bis 16 
kultiviert wird. 

29. Verfahren nach Anspruch 27 oder28 zur Produktion eines Polypeptids nach einem der Anspruche 1 9 bis 23 und 26. 

30. Pharmazeutische Zusammensetzung, die ein Polypeptid. das gemaB dem Verfahren nach Anspruch 27 28 oder 
29 produziert ist, und ein pharmazeutisch annehmbares Verdunnungsmittel, Adjuvans oder Tragermittel umfaBt. 

31. Pharmazeutische Zusammensetzung nach Anspruch 30, die ein Polypeptid nach einem der Anspruche 1 9 bis 23 
und 26 umfaBt. 

Revendlcatfons 

1 . Sequence d'ADN utilisable pour garantir ('expression, dans una cellule hate procaryote ou eucaryote d'un produit 
polypeptKJique ayant au moins une partie de la conformation structurelle primaire de celle de l'6rythro.H>i6tine pour 
lu. permettre de posseder la propriete biologique de provoquer. par les cellules de la moelle osseuse I'augmen- 
tat.on de la production des reticulocytes et des globules rouges, et ('augmentation de la synthese d'hemoglobine 
ou du captage de fer, ladite s6quence d'ADN 6tant choisie dans le groupe consthu6 de : 

a) les sequences d'ADN indiquees dans les tableaux V et VI ou leurs brins complementaires ■ 

b) des sequences d'ADN qui s'hybrident dans des conditions strides aux regions de codage de proteine des 
sequences d'ADN d6finies dans (a) ou des fragments de celles-ci; et 

2*™ ^ uenc f d ' ADN ( " ji ' n ' 6tart la degenerescence du code genetique, s'hybrideraient aux sequences 
d ADN definies dans (a) et (b). 

2. Sequence d'ADN seton la revendication 1 , codant l'6rythropoi6tine humaine. 

3. Sequence d'ADNc selon la revendication 1 , qui est une sequence d'ADN codant I'erythropoieline de singe. 

4. Sequence d'ADN selon la revendication 3 et incluant la region de codage de proteine indiquee dans le tableau V. 

5. Sequence d'ADN genomique selon la revendication 1 ou 2. 

6. Sequence d'ADN codant l'6rythropoietjne humaine selon la revendication 5. 

7. Sequence d'ADN selon la revendication 6 et incluant la region de codage de proteine indiquee dans le tableau VI. 

8. Sequence d'ADN selon la revendication 1 ou 2, associee de facon covalente k une substance de marquage de- 

10CI3DIB. 

9. Sequence d'ADN selon la revendication 8, dans laquelle le marqueur detectable est un radiomarqueur. 
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10. Sequence d'ADN simple brin selon la revendication 8 ou 9. 

11. Sequence d'ADN selon la revendication 1 codant pour [Phe^jEPO, [Phe^hEPO, [Phe 14 5lhEPO IHis'lhEPO 
[Asn2 des-Pro2 a l| e 6]hEPO, [des-Thr"* a Arg'66 ]hE p 0| ou [A27-55]hEPO. . 

12. Cellule note procaryote ou eucaryote transformee ou transfectee par une sequence d'ADN selon I'une quelconque 
des revendications 1 , 2, 3. 6, 7 et 8, d'une facon permettant a la cellule hole d'exprimer ledit produit polypeptidique. 

13. Cellule h6te transformee ou transfectee selon la revendication 12, laquelle cellule hdte est capable de qlycosyler 
ledrt polypeptide. a 1 ' 

14. Cellule h6te de mammifere transformee ou transfectee selon la revendication 13. 

15. Cellule COS transformee ou transfectee selon la revendication 1 3. 

16. Cellule CHO transformee ou transfectee selon la revendication 13. 

17. Vecteur d'ADN viral ou de plasmide circulaire biologiquement fonctionnel, incluant une sequence d'ADN selon une 
quelconque des revendications 1 , 2, 3, 5, 6, 7 ou 1 1 . 

18. Cellule hdte procaryote ou eucaryote transformee ou transfectee de f a9 on stable par un vecteur d'ADN selon la 
revendication 17. 

19. Polypeptide recombinant ayant tout ou partie de la conformation structured primaire de l'6rythropoTetine humaine 
ou de singe, comme indiqu6e dans le tableau VI ou le tableau V, ou tout variant alieiique ou derive de celui-ci 
possedant la propri6t6 biologique de provoquer, par les cellules de la moelle osseuse, I'augmentation de la pro- 
duction des reticulocytes et des globules rouges et I'augmentation de la synthase d'h6moglobine ou du captage 
de fer, et caract6ris6 en ce qu'il est le produrt de ('expression eucaryote d'une sequence d'ADN exogene et qui 
presente un pods mol6culaire plus 6lev6 par SDS-PAGE a partir de l'6rythropoT6tine iso!6e de sources urinaires. 

20. Polypeptide de glycoproteine selon la revendication 1 9 ayant une composition moyenne de carbohydrate qui difffcre 
de celle de 1 6rythropoi6tine humaine isolee de sources urinaires. 

21. Polypeptide selon la revendication 19 ou 20, dans lequel la sequence exog&ne est une sequence d'ADNc. 

22. Polypeptide selon la revendication 19 ou 20, dans lequel la sequence d'ADN exog6ne est une sequence d'ADN 
genomique. 

23. Polypeptide selon la revendication 19 ou 20, dans lequel la sequence d'ADN exogene est portee sur un vecteur 
viral ou de plasmide d'ADN circulaire se repliquant de facon autonome. 

24. Polypeptide selon I'une quelconque des revendications 19 a 23, caract6ris6 de plus en etant associe de facon 
covalente a une substance de marquage detectable. 

25. Polypeptide selon la revendication 24, dans lequel ledit marqueur detectable est un radiomarqueur. 

26. Produit polypeptidique de ('expression, dans une cellule hole eucaryote, d'une sequence d'ADN selon une quel- 
conque des revendications 1, 2, 3, 5 ,6 et 7. H - 

27. Precede pour produire un polypeptide ayant au moins une partie de la conformation structured primaire de 
I erythropo.etme pour lui permettre de posseder la propriete biologique de provoquer, par les cellules de la moelle 
osseuse I augmentation de la production des reticulocytes et des globules rouges, et I'augmentation de la synthese 
d hemoglobine ou du captage de fer, lequel proc6de est caract6rise par la mise en culture, dans des conditions 
nutrrtwes appropr.ees, d'une cellule hote procaryote ou eucaryote transformee ou transfectee avec une sequence 
d ADN selon une quetonque des revendications 1. 2, 3, 5, 6 et 7 d'une mani6re permettant a la cellule hfite 
d expnmer led,t polypeptide et, eventuellement, I'isolement du produit polypeptidique souhai.e de ('expression de 
ia sequence d ADN. 
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28. Precede selon la revendication 27, caracteris6 par la mise en culture d'une cellule hate selon I'uhe quelconque 
des revendications 12 a 16. M 

29 ' 5?t? ' a revendication 27 ou 28 P° ur P"*"™ un polypeptide selon rune quelconque des revendications 

la a 23 6t 26. 

30. Composition phamnaceutique comprenant un polypeptide produit conformement au procSde de la revendication 
27, 28 ou 29 et un support, adjuvant ou diluant pharmaceutiquement acceptable. 

31. Composition pharmaceutique selon la revendication 30, comprenant un polypeptide selon une quelconque des 
revendications 1 9 a 23 et 26. 
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